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CHAPTER  1 


INTRODUCTION 


1.1  Abstract 

In  this  thesis*  we  develop  single  stage  (fixed  sample  size)  asympto¬ 
tically  optimal  (minimax)  procedures  for  ranking  populations  in  the  presence 
of  nuisance  parameters,  when  the  populations  are  ranked  according  to  a 
parameter  of  the  distribution  and  the  so-called  indifference- zone  approach 
to  ranking  and  selection  problems  is  employed.  We  adapt  methods  proposed 
by  Weiss  and  Wolfowitz  in  developing  asymptotically  optimal  (minimax)  pro¬ 
cedures  for  a  certain  class  of  2-decision  tests  of  composite  hypotheses 
problems  in  the  presence  of  nuisance  parameters  to  multiple  decision  ranking 
and  selection  problems  in  the  presence  of  nuisance  parameters. 

For  the  problem  of  selecting  the  "best"  population,  asymptotically 
optimal  procedures  are  developed  for  situations  in  which  the  joint  density 
function  of  the  observations  satisfies  certain  mild  regularity  conditions 
(similar  to  those  imposed  by  Weiss  and  Wolfowitz).  The  method  of  analysis 
and  basic  theory  is  developed  in  detail  for  this  case.  The  basic  results 
are  extended  to  develop  asymptotically  optimal  procedures  for  certain 
other  ranking  goals  considered  in  the  literature.  Some  examples  are 
included  to  illustrate  the  applicability  of  the  results  to  specific 
distributions. 

For  ranking  and  selection  problems  with  joint  density  function  of 
observations  not  satisfying  the  regularity  conditions,  i.e.,  non-regular 
cases,  we  illustrate  the  applicability  of  the  basic  method  by  developing 
asymptotically  optimal  procedures  for  ranking  non-regular  exponential  and 

thesis  submitted  in  partial  fulfillment  of  the  requirements  for  the  degree 
of  Doctor  of  Philosophy  in  the  Field  of  Operations  Research,  Cornell 
University,  June  1970. 
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uniform  distributions. 

The  results  in  this  thesis  can  be  thought  of  as  generalizing  the 
basic  method  of  Weiss  and  Wolfowitz  for  2-decision  hypothesis  testing 
problem  to  multiple  decision  ranking  and  selection  problems.  As  a  con¬ 
sequence  of  our  results,  we  show  the  asymptotically  optimal  character  of 
certain  so-called  natural  selection  procedures  which  already  have  been 
proposed  in  the  literature.  We  also  develop  single-stage  asymptotically 
optimal  procedures  for  certain  problems  for  which  heretofore  no  single- 
stage  procedures  had  been  proposed. 

1.2  Outline  of  the  Thesis 

In  Section  1.1,  we  have  given  an  overview  of  the  problem  considered 
below  and  of  the  results  obtained.  In  the  present  section  we  outline  the 
contents  of  the  various  chapters. 

In  Section  1.3  we  give  a  brief  introduction  to  ranking  and  selection 
problems.  In  Section  1.4,  we  introduce  the  basic  method  proposed  by  Weiss 
and  Wolfowitz  [5S]  in  developing  asymptotically  minimax  tests  of  composite 
hypothese;  In  Section  1.5,  we  point  out  that  by  treating  Ranking  and 
Selection  problems  in  the  framework  of  statistical  decision  theory,  the 
basic  method  of  Weiss  and  Wolfowitz  can  be  extended  to  develop  asympto¬ 
tically  optimal  ranking  procedures. 

In  Chapter  2,  we  consider  the  problem  of  selecting  the  "best"  popula¬ 
tion.  The  notation  used  throughout  the  thesis  is  defined.  Mild  regularity 
conditions  imposed  on  the  density  functions  are  specified.  Some  preliminary 
results  of  statistical  decision  theory  are  included.  Tho  ranking  problem 


is  structured  as  a  decision  theory  problem  and  an  asymptotically  optimal  pro¬ 
cedure  for  a  particular  zero-one  loss  function  is  obtained.  The  rate 
of  convergence  of  the  decision  variables  to  the  asymptotically  normally 
distributed  variables  is  studied.  The  general  results  are  applied  to 
certain  specific  distributions,  and  asymptotically  optimal  procedures 
noted  in  each  case.  Large  sample  applications  of  our  results  are 
illustrated  by  indicating  how  the  procedure  would  be  used  in  ranking 
means  of  normal  population  with  common  unknown  variance. 

In  Chapter  3,  we  extend  our  basic  method  to  develop  asymptotically 
optimal  procedures  for  certain  other  ranking  goals.  Asymptotically  optimal 
procedures  are  developed  for  the  problem  of  selecting  a  fixed-size  subset 
to  contain  the  best  population  and  for  the  problem  of  selecting  one  of 
the  t  best  populations.  We  discuss  certain  other  general  ranking  goals 
which  have  been  considered  in  the  literature  and  develop  asymptotically 
optimal  procedures  for  two  additional  ones. 

In  Chapter  4,  we  note  some  density  functions  not  satisfying  the 
regularity  conditions,  and  develop  for  non-regular  exponential  and  uniform 
distributions,  asymptotically  optimal  procedures  for  selecting  the  best 
population  and  for  certain  other  ranking  goals. 

1.3  Ranking  and  Selection  Problems 

Bechhofer  [4]  in  his  pioneering  paper  pointed  out  the  inappropriate¬ 
ness  of  the  traditional  practice  of  testing  null  hypotheses  and  proposed 
for  a  certain  class  of  problems,  termed  ranking  problems,  the  basic  con¬ 
cepts  of  his  multiple  decision  ranking  approach.  These  methods  for  ranking. 
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or  partially  ranking,  a  group  of  populations  on  the  basis  of  an  experi¬ 
ment  are  of  great  practical  importance,  especially  in  connection  with 
the  problem  of  selecting  the  best  populations  from  a  set  of  competing 
ones. 

Since  the  paper  by  Bechhofer,  a  considerable  number  of  papers  have 
been  written  on  ranking  and  selection  problems.  Although  there  are  many 
ways  in  which  such  problems  can  br  formulated,  the  two  most  commorn  formu¬ 
lations  in  the  literature  are  the  "Indifference  Zone"  approach,  as  pro¬ 
posed  by  Bechhofer  [4],  and  the  "Su*  ,et"  approach,  credited  to  Gupta  [20], 

In  this  thesis,  we  concentrate  on  the  indifference-zone  approach  to  rank¬ 
ing  and  selection  problems.  Unless  otherwise  stated,  in  all  references 
to  ranking  and  selection  problems  in  this  thesis,  it  is  understood  that 
we  refer  to  the  indifference- zone  approach  to  the  problem. 

The  general  nature  of  investigations  in  ranking  and  selection  problems 
may  be  summarized  as  follows: 

Let  ni,n2,...,nk  denote  k  populations  (k  >_  2)  being  ranked, 
with  F(»;0t)  denoting  the  distribution  function  of  (t  *  l,2,...,k). 

Here  is  a  vector  of  population  parameters.  The  populations  are 

ranked  on  the  basis  of  a  well  defined  scalar  ^ (0 ^ ) ,  t  *  l,2,...,k. 

Each  0t  may  be  completely  or  partly  unknown,  but  the  functional  form 
{ i>  is  known.  (In  most  problems,  is  one  of  the  parameters  of  the 
distribution.)  The  ordered  ^  are  denoted  by  <_  1  •••  i  • 

It  is  assumed  that  the  pairing  of  the  (t  *  l,2,...,k)  with  the 

^  *  l,2,...,k)  is  completely  unknown.  Let  II ^  denote  the 
population  associated  with 
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It  is  assumed  that  the  larger  the  value  of  the  better  the 
associated  population;  the  population  associated  with  is 

denoted  the  "best"  population.  In  general,  n()<_t«-i)  ^  *  l»2,...,k) 

is  denoted  the  "t^  best"  population.  (Analogously,  in  appropriate 
situations,  we  may  consider  smaller  values  of  ^  as  more  desirable  and 
denote  as  the  "best"  population.) 

Selection  Procedures  (SP)  are  set  up  in  such  a  way  as  to  guarantee 
certain  requirements  on  the  probability  of  "Correct  Selection,"  where  the 
definition  of  correct  selection  depends  on  the  ranking  "Goal"  being  con¬ 
sidered.  Some  of  the  goals  considered  in  the  literature  are 


i)  Select  the  t  best  populations  (1  <_  t  <  k) 

a)  with  regard  to  order 

b)  without  regard  to  order 


ii)  Select  s  of  the  t  best  populations  (1  <_s  <_t  <  k). 

iii)  Select  a  subset  of  s  populations  to  contain  the  t  best 
populations  (1  <_  t  <_  s  <  kj. 

iv)  Select  a  subset  of  s  populations  to  contain  at  least  d  of 
t  best  populations. 


For  a  given  goal,  with  associated  definition  of  correct  selection, 
we  are  interested  in  any  Selection  Procedure  (SP)  which  guarantees  the 
probability  requirement 


(1.1)  inf  P(CS|SP)  >_  P* 

where  u  =  (0 j ,62# •  •  •  *6^) »  G  is  the  set  of  all  possible  w,  and  ^(6*), 
called  the  preference  zone,  is  a  certain  subspace  of  the  parameter  space 
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n,  defined  according  to  the  goal  being  considered  and  a  certain  distance 
function  6 (*,•)•  The  function  fi(b,a)  measures  the  "distance"  between 
two  populations  with  values  of  parameter  of  interest  ip  being  b  and 
a  respectively,  with  a  <_  b.  It  is  assumed  to  be  i)  non-negative 
ii)  equal  to  zero  if  and  only  if  a  *  b,  and  iii)  increasing  in  b  for 
fixed  a  and  decreasing  in  a  for  fixed  b. 

For  example,  for  Goal  i),  6^  »  6^[k  t+1]*  *^[k  t]^  me&sures  the 

distance  between  the  set  of  t  best  populations  (n(k)»‘**,Il(k  t+1)^ 

and  the  remaining  (k-t)  worst  populations;  and  n(6*)  may  be  defined 


as 


(1.2)  n(o*)  ■  {w  c  n 1 6 1  >_  6*} 

If  1,  is  the  location  parameter  cf  nt  (t  *  l,2,..,,k),  then  one 

may  choose  6  *  ip, .  ^  as  a  natural  distance  function  for  the 

t  IK-t+lJ  IX-tJ 

problem;  but  for  other  cases  6(*,*)  may  be  defined  using  practical  and/or 
theoretical  considerations. 

Here  t(6*,  P*)  }  with  6*  >  0,  A (k)  <  P*  <  1  are  constants, 
specified  prior  to  start  of  experimentation.  A(k)  is  a  lower  bound  on 
the  specified  probability  P*,  which  depends  on  the  number  of  populations 
(k)  and  the  goal  being  considered.  It  is  the  probability  which  could  be 
achieved  by  selecting  at  random  and  not  carrying  out  any  experimentation. 
Then  for  a  given  goal  and  distance  function,  and  specified  constants 


(6* ,P* ) ,  a  selection  procedure  SP(5*,P*)  satisfying  (1.1)  is  defined. 
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A  selection  procedure  comprises  a  terminal  decision  rule  and  a  sampling 
rule  (and  associated  stopping  rule  if  sequential  sampling  is  used). 

The  sampling  rule  may  be  single  stage  (fixed  sample  size),  two  or  more 
stages  or  fully  sequential. 

In  the  field  of  ranking  and  selection  problems,  selection  procedures 
have  been  proposed  for  the  particular  problem  at  hand.  In  view  of  the 
above,  a  particular  ranking  problem  is  characterized  by:  i)  the  form 
of  distribution  function  F(*,*)»  ii)  the  scalar  tf/(*)  by  which 
populations  are  ranked  iii)  the  ranking  goal  and  iv)  the  distance 
function  6 (*,*)* 

Thus,  for  example,  Bechhofer  [4  ]  proposed  a  single  stage  procedure 
for  selecting  the  best  normal  population,  when  populations  are  ranked 
according  to  their  means  (with  known  variances),  and  the  difference 
between  the  largest  and  second  largest  mean  is  the  distance  function. 

In  [4],  Bechhofer  also  proposed  single  stage  selection  procedures  for 
certain  other  goals  for  ranking  means  of  normal  populations  with  known 
variances.  Tables  of  sample  sizes  required  to  meet  the  basic  probability 
requirement  (1.1)  were  provided.  These  tables  would  also  be  useful  in 
obtaining  a  large  sample  approximation  to  sample  size  in  using  single 
stage  selection  procedures  for  ranking  parameters  of  certain  other  distri¬ 
butions.  It  is  also  useful  as  an  approximation  for  ranking  variances 
of  normal  populations,  a  problem  for  which  tables  of  exact  sample  size 
for  a  single  stage  selection  procedure  were  provided  by  Bechhofer  and 
Sobel  (  8 ] . 


i 
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Bechhofer,  Dunnett  and  Sobel  [5  )  proposed  a  two  stage  procedure 
for  ranking  means  of  normal  populations  with  a  common  unknown  variance. 

In  the  proposed  procedure,  the  unknown  variance  is  estimated  from 
observations  obtained  in  the  first  stage,  and  an  additional  random  num¬ 
ber  of  observations  are  taken  at  the  second  stage,  the  number  depending 
on  the  outcome  of  the  first  stage  are  determined  so  as  to  guarantee  the 
basic  probability  requirement. 

Paulson  (36]  proposed  a  class  of  sequential  procedures  for  selecting 
the  normal  population  with  the  largest  mean,  the  populations  having  a 
common  variance;  when  the  common  variance  is  known,  the  sequential  pro¬ 
cedure  is  closed.  Paulson  (37]  also  proposed  a  sequential  procedure  for 
selecting  the  best  binomial  population.  Hoel  and  Mazumdar  [26]  have 
extended  Paulson's  open  sequential  procedure  to  solve  the  problem  of 
selecting  the  best  from  the  class  of  Koopman-Darmois  family  of  distributions. 

Bechhofer,  Kiefer  and  Sobel  (  7 ]  have  proposed  sequential  procedures 
(including  generalizations  of  Paulson's  [36]  procedure)  for  ranking  problems 
associated  with  the  Koopman-Darmois  family  of  distributions.  Perng  [38] 
has  recently  compared  the  asymptotic  expected  sample  sizes  of  the  two 
sequential  procedures  of  Paulson  [36]  and  Bechhofer,  Kiefer  and  Sobel  [  7  ] 
for  problem  of  ranking  normal  means  with  common  known  variance.  In  the 
literature,  there  are  no  other  sequential  procedures  proposed  for  the 
ranking  problems  which  satisfy  the  basic  probability  requirement.  Robbins, 
Sobel  and  Stair  [41],  Srivastava  [46]  and  Srivastava  and  Oglivie  [47] 
have  proposed  sequential  procedures,  for  the  problem  of  ranking  means  of 
populations  with  common  unknown  variance,  which  satisfy  the  probability 
requirement  only  asymptotically  (as  6*  0). 
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Since  this  discussion  is  not  intended  to  be  an  extensive  review 
of  the  literature,  we  refer  the  reader  to  the  monograph  by  Bechhofer, 
Kiefer  and  Sobel  [  7]  for  a  comprehensive  bibliography  on  Ranking  and 
Selection  problems  (for  the  indifference  zone  approach  and  other 
approaches  considered  in  the  literature).  Also  see  Ramberg  [39]  for 
some  recent  work  on  certain  ranking  problems  associated  with  multivariate 
normal  populations. 

In  this  thesis,  we  are  only  concerned  with  single  stage  procedures 
and  the  following  discussion  refers  only  to  such  procedures  for  ranking 
and  selection  problems.  In  most  of  the  work  done  in  this  field,  the 
selection  procedures  which  have  been  proposed  initially  were  developed 
more  or  less  on  an  "intuitive"  basis  (so-called  natural  selection  proce¬ 
dures)  to  satisfy  the  basic  probability  requirement  imposed  on  the  proce¬ 
dures.  Bahadur  and  Goodman  [  1  ]  considered  a  class  of  multiple-decision 
rules  which  they  called  impartial  (invariant  under  permutations  of  the 
populations).  Their  results  are  applicable  to  the  problem  of  selecting 
the  best  population  and  imply  that  Bechhofer* s  ([4])  and  Bechhofer  and 
Sobel's  ([8])  natural  selection  procedures  are  minimax  rules  (in  fact, 
uniformly  minimum  risk  rules)  among  the  class  of  impartial  decision  rules. 

Hall  [25]  removed  the  restriction  of  impartiality  and  proved  the 
optimality  of  the  natural  selection  procedures  by  proving  their  minimax 
character  (by  introducing  a  suitable  zero-one  type  loss  function).  Hall's 
results  are  applicable  not  only  for  the  problem  of  selecting  the  best 
population,  but  also  for  the  problem  of  ranking  a  specified  number  of 
populations,  with  or  without  regard  to  order.  Hall's  results  apply  to 


problems  wherein  the  ranking  parameter  is  a  location  or  scale  parameter 
and  for  which  there  is  a  sufficient  statistic  (for  each  sample  size)  with 
a  monotone  likelihood  ratio.  In  situations  where  the  ranking  parameter 
is  not  a  location  or  scale  parameter.  Hall  showed  the  "most  economical" 
character  of  natural  selection  procedures  for  a  specified  location  of  the 
ranking  parameter.  Since  for  many  problems,  a  least  favorable  location 
(of  the  ranking  parameter)  can  be  determined,  Hall  concluded  that  the 
optimal  character  of  the  natural  selection  procedure  can  be  shown  to 
hold  irrespective  of  the  location  of  the  ranking  parameter.  Thus,  Hall's 
result  applies  to  the  problem  of  selecting  the  best  population  when¬ 
ever  there  is  a  sufficient  statistic  with  a  monotone  likelihood  ratio, 
and  therefore,  in  particular,  if  its  distribution  is  in  the  exponential 
family. 

Lehsnnn  [31]  extended  the  results  of  Bahadur  and  Goodman  [  1  ] 
to  show  the  optimality  of  natural  selection  procedures,  among  the  class 
of  impartial  decision  rules,  for  problems  of  ranking  (with  cr  without 
regard  to  order)  populations,  when  the  ranking  parameter  has  a  sufficient 
statistic  with  a  monotone  likelihood  ratio.  Lehmann  [31]  also  showed 
certain  other  optimum  properties  of  the  natural  selection  procedures  and 
provided  an  alternate  proof  to  results  of  Hall  [25],  Eaton  [17]  has  shown 
the  optimality  of  natural  selection  procedures,  among  the  class  of  impartial 
decision  rules,  when  the  ranking  parameter  has  a  sufficient  statistic 
which  has  a  certain  monotonicity  property  (defined  in  [17]  and  similar 
to  the  rankability  condition  in  (  7]).  Eaton's  results  thus  extend  the 
optimality  of  selection  procedures  to  a  larger  class  of  density  functions. 
Fabian  [19]  has  shown  certain  other  optimum  properties  of  natural  selection 
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procedures  for  ranking  problems. 

As  a  consequence  of  the  results  in  [1],  (25],  (31]  and  [17],  we 
note  that  the  natural  selection  procedures  proposed  for  the  following 
problems  are  indeed  optimal  among  the  class  of  all  single-stage  procedures. 
(The  references  noted  in  brackets  indicate  the  papers  in  which  the  proce¬ 
dures  were  proposed). 

(i)  Ranking  means  of  univariate  normal  population  with  common  known 
variance  (Bechhofer  [4]). 

(ii)  Ranking  variances  of  univariate  normal  populations  (Bechhofer 
and  Sobel  [  8 ]) , 

(iii)  Selecting  the  best  of  several  binomial  populations  (Sobel 
and  Kviyett  [44]). 

(iv)  Selecting  the  multinomial  event  which  has  the  highest  proba¬ 
bility  (Bechhofer,  Elmaghraby  and  Morse  [6]). 

(v)  Selecting  the  bivariate  normal  population  with  largest  correla¬ 
tion  coefficient  (Ramberg  (39];  also  given  as  an  example  in 
iiaton  [17])  . 

(vi)  Selecting  the  component  with  the  largest  mean  in  ranking  from 
a  sin.le  multivariate  normal  population  with  common  known 
varia  v.e  and  covariance  of  the  components.  (Given  as  an  example 
in  Eaton  [17]  and  Milton  (34]). 

In  most  of  the  work  done  in  ranking  problems,  populations  are  ranked 
according  to  values  of  a  certain  parameter  in  the  distribution  of  the 
populations.  Other  inknown  parameters  in  the  distribution,  if  any,  would 
constitute  "nuisance"  parameters  for  the  ranking  problem.  In  the  work 
cited  above  on  optimality  of  single  stage  ranking  procedures,  not  much 
explicit  consideration  is  given  to  the  nuisance  parameters.  For  example, 
in  [17],  it  is  assumed  that  a  sufficient  statistic  exists  for  any  unknown 
nuisance  parameters,  and  the  nuisance  parameters  are  such  that  the  basic 
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probability  requirement  is  guaranteed  for  all  possible  values  of  unknown 
nuisance  parameters  by  a  finite  sample  size  single  stage  procedure.  For 
problems,  where  due  to  unknown  nuisance  parameters,  no  finite  sample 
size  single-stage  procedure  satisfying  the  basic  probability  requirement 
exists,  one  may  like  to  develop  single-stage  procedures  which  satisfy  the 
probability  requirement  asymptotically  (as  6*  -*•  0).  No  previous  work 
seems  to  have  been  done  for  such  problems.  The  procedures  developed  in 
this  thesis  are  applicable  for  such  problems.  These  procedures  which 
satisfy  the  probability  requirement  asymptotically,  are  also  asymptotically 
optimal  among  the  class  of  all  decision  rules. 

In  most  of  the  work  on  ranking  problems,  large  sample  approximations 
to  the  single-stage  sample  size  (needed  to  guarantee  the  probability  require¬ 
ment)  are  suggested.  It  would  be  interesting  to  study  the  asymptotic 
properties  of  the  large  sample  approximations  to  the  natural  selection 
procedures.  Asymptotically  optimal  procedures  developed  in  this  thesis 
answer  this  question  and  we  show  the  asymptotically  optimal  character  of 
certain  natural  selection  procedures  which  already  have  been  proposed  in 
the  literature. 

Finally,  in  the  field  of  ranking  and  selection  problems,  there  are 
certain  problems  for  which  no  single  stage  procedures  have  been  proposed 
(for  example  ranking  scale  parameters  of  Weibull  distributions  with  known 
location  parameter  and  common  (known  or  unknown)  shape  parameter). 
Asymptotically  optimal  procedures  developed  in  this  thesis  are  applicable 
to  such  problems  too,  thus  indicating  the  wide  applicability  of  our 


results. 
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1.4  Asymptotically  minimax  tests  of  composite  hypotheses 

For  testing  a  simple  hypothesis  versus  a  simple  alternative,  the 
Neyman- Pearson  Lemma  provides  an  optimal  test.  That  is  among  the  class 
of  all  fixed  sample  size  tests  with  level  of  significance  a  (0  <  a  <  1), 
the  optimal  test  maximizes  the  power.  For  a  certain  class  of  composite 
hypotheses  testing  problems,  Neyman  {35]  obtained  asymptotically  optimal 
tests  in  a  certain  class  of  asymptotically  similar  tests.  Lecam  [29] 
extended  Neyman's  results  to  obtain  asymptotically  optimal  tests  among 
a  larger  class  of  asymptotically  similar  tests.  Bartoo  and  Puri  [3] 
and  Buhler  and  Puri  [11]  have  extended  Neyman's  result  to  slightly  more 
general  setups. 

Very  recently,  Weiss  and  Wolfowitz  (55]  have  obtained  asymptotically 
minimax  (optimal)  procedures  for  a  certain  class  of  composite  hypotheses 
testing  problems.  Using  the  basic  method  developed  by  the  authors 
((52]  and  (53])  in  the  general  theory  of  asymptotically  efficient  esti¬ 
mators,  Weiss  and  Wolfowitz  obtained  asymptotically  optimal  tests  of 
hypotheses  in  the  presence  of  nuisance  parameters.  There  are  no  arbitrary 
restrictions  on  the  class  of  tests  among  which  optimal  tests  are  being 
developed;  hence  the  tests  are  asymptotically  optimal  tests  among  the 
class  of  all  tests.  The  general  theory  is  developed  in  (55]  for  the 
class  of  density  functions  satisfying  some  mild  regularity  conditions; 
but  the  basic  idea  can  be  used  for  the  non-regular  cases  as  well,  each 
such  non-regular  case  requiring  special  analysis. 

The  basic  method  of  analysis  and  the  problem  considered  by  Weiss 
and  Wolfowitz  may  be  summarized  as  follows: 
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Let  Xj,X2»...,Xn  be  independent,  identically  distributed  random 
variables  with  the  common  density  function  f(*;6).  The  density  function 
is  characterized  by  the  vector  parameter  6  «  (0j ,02, . . . *ep»6p+i) »  P  1  1. 

(01,02, ...  ,9p)  are  the  unknown  nuisance  parameters,  and  8  ^  is  the 

parameter  being  tested.  Let  be  the  (null)  hypothesis  that  0^+1  *  0  j 

and  H  be  the  (alternative)  hypothesis  that  0  .  *  (T  ♦  —  ,  where 

P*  P* 

<T  j  is  a  given  constant  and  c  is  a  given  positive  constant. 

Weiss  and  Wolfowitz  [SS]  consider  the  problem  of  testing  the  null 
hypothesis  HQ  versus  the  alternative  hypothesis  H^.  They  obtain  an 
asymptotically  optimal  test.  That  is,  among  the  class  of  all  tests  (of 
Hq  versus  Hj)  which  (is  the  limit  as  n  •*  »)  have  level  of  significance 
less  than  or  equal  to  a  (0  <  a  <  1),  the  optimal  test  maximizes  (in 
the  limit  as  n  -*  »),  the  power  function. 

In  order  to  develop  an  asymptotically  optimal  test,  Weiss  and 
Wolfowitz  first  solve  the  following  sequence  of  artificial  problems 
(one  for  each  n) : 

Hq  and  are  the  two  hypotheses  as  given  above.  The  statistician 
does  not  know  (0j,...,0p)  but  does  know  that 

M  (0) 

(1.3)  |6i  -  ft.  |  ^  -  i  *  1,2,. ...p  and 

/n 


0  ,  «  0  ,  or  0  ,  ♦  — 

p*l  p*l  p*l  ^ 


and  wishes  to  test  versus  Here  Mn(6)  is  a  positive  function 

M  (0) 

of  n  such  that  H  (0)  -*•  ®,  - 

n  /n 


0.  The  above  problem  is  an 
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artificial  problem  since  it  is  assumed  that  the  statistician  knows 


«ve2 . v 

An  asymptotically  optimal  (minimax)  procedure  is  developed  by 

constructing  a  Bayes  decision  rule  for  the  problem  of  testing  hypothesis 

Hq  versus  Hj  (when  the  loss  function  is  zero-one:  the  loss  if  zero 

or  one  according  as  the  correct  or  incorrect  decision  is  made)  relative 

to  the  following  apriori  distribution:  a  total  mass  of  b  is  spread 

M  (6)  _  M  re) 

uniformly  over  the  set  0.  -  -  <  0-  <  0.  ♦  -  i  *  l,2,...,p 

1  -  1  -  1 


and  0  ^  =  0p+j  and  a  total  lass  of  (1-b)  is  spread  uniformly  over 

M  (0)  _  M„(i) 

the  set  IT - <  0.  <  "e.  ♦  - 


/n 

0  .  ■  $  ♦  —  . 
P41  P41 


-  ui  -  ui 


& 


i  *  1 , 2 , . . . , p  and 


Weiss  and  Wolfowitz  (55J  obtained  a  Bayes  decision  rule  for  the 
above  problem  and  studied  the  asymptotic  properties  of  the  decision  rule 
for  a  class  of  density  function  satisfying  certain  mild  regularity  condi¬ 
tions  (stated  in  [55]  and  very  similar  to  the  regularity  conditions 
imposed  for  our  problem  in  Section  2.1).  Since  they  are  interested  in 
asymptotic  behavior  of  the  decision  rule,  the  apriori  mass  b  (0  <  b  <  1) 
is  adjusted  in  such  a  way  that  the  level  of  significance  for  the  artificial 
hypothesis  testing  problem  is  equal  to  a  specified  level  a  (0  <  a  <  1). 
Then,  the  Bayes  decision  rule  for  this  specially  selected  prior  is, 
by  the  very  nature  of  being  a  Bayes  decision  rule,  an  asymptotically 
optimal  procedure  for  the  sequence  of  artificial  problems.  It  is  shown 
in  [55]  that  the  asymptotically  minimax  (optimal)  procedure  is  a  function 
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of  (e'j,^,...,?  ),  which  was  assumed  to  be  known  for  the  sequence  of 
artificial  problems. 

To  obtain  an  asymptotically  minimax  procedure  for  the  real  problem, 
Weii,s  and  Wolfowitz  (55]  propose  using  the  asymptotically  minimax  procedure 
for  the  artificial  problem,  with  (©"^ , 6*2 in  the  decision  variable 
replaced  by  estimators  (0j  (n))  of  the  unknown  nuisance 

parameter.  These  estimators  of  (e|_»®2****  •  ®p)  •  ^ase<*  on  xi*X2*,,,,Xn 
satisfy  the  following  consistency  condition;  for  any  e  >  0,  there  exists 
D(e)  <  ®  such  that 


(1.4) 


P  {/n|e.(n)  -  8.)  <  D(e)  i  «  l,2,...,p}  >  1  -  e 


Hq  or  Hj  is  true 


Because  of  the  above  consistency  condition,  the  proposed  decision 
rule  has  the  same  asymptotic  properties  as  the  asymptotically  optimal 
rule  for  the  artificial  problem,  and  is  hence  asymptotically  optimal  for 
the  real  problem. 

The  basic  method  (outlined  above  without  going  into  any  mathematical 


analysis)  proposed  by  Weiss  and  Wolfowitz  is  very  powerful,  yet  very  simple, 
and  is  adopted  in  this  thesis  to  develop  asymptotically  optimal  ranking 
procedures . 
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1.5  Ranking  problems  in  the  framework  of  statistical  decision  theory  and 
some  additional  remarks: 

In  the  field  of  ranking  and  selection  problems,  Sommerville  [45] 
was  the  first  author  explicitly  to  consider  the  ranking  problem  in  the 
framework  of  statistical  decision  theory.  By  introducing  a  loss  structure 
into  the  ranking  problem,  Sommerville  obtained  a  minimax  decision  rule 
which  balanced  the  cost  of  experimentation  against  the  expected  loss 
associated  with  a  wrong  decision.  However,  Sommerville  considered  only 
prespecified  selection  procedures,  whereas  we  are  concerned  with  develop¬ 
ing  selection  procedures  which  are  in  some  sense  optimal  in  the  class  of 
all  decision  rules.  Bechhofer,  Kiefer  and  Sobel  (p.  46  of  [7])  also 
point  out  how  the  ranking  problem  could  be  reduced  to  a  decision  theory 
problem  by  introducing  a  suitable  loss  function. 

In  the  work  cited  in  Section  1.3  on  showing  the  optimality  of 
natural  selection  procedures,  the  ranking  problem  is  treated  in  the 
framework  static tiv  :i  decision  theory.  The  optimality  of  natural 
selection  procedures  ir.  proved  by  treatir?  the  problem  as  a  roltiple- 
decision  problem  sr.d  shewing  the  minimax  character  of  tiie  procedure, 
by  introducing  a  suitable  loss  function  into  the  problem.  In  this 
thesis  too,  we  trer.t  the  ranking  problem  in  the  f rainow-'u-k  of  ’r.’ald's  [48] 
statistical  decision  theory  as  a  multiple  decision  problem,  wherein  the 
number  of  decisions  depend  on  the  number  of  populations  being  ranked 
and  on  the  ranking  goal  being  considered.  A  zero-one  type  loss  function, 
suitable  for  the  ranking  goal  being  considered,  is  introduced  and  adopt¬ 
ing  the  basic  method  proposed  by  Weiss  and  Wolfowitz  [55]  for  2-decision 
problems  in  the  presence  of  unknown  nuisance  parameters,  we  proceed  to 


develop  an  asymptotically  minimax  procedure  for  the  multiple-decision 
ranking  problem  in  the  presence  of  unknown  nuisance  parameters.  As 
in  [55],  we  first  develop  an  asymptotically  minimax  procedure  for  a 
sequence  of  artificial  problems,  which  in  turn  gives  an  asymptotically 
minimax  procedure  for  the  ranking  problem. 

In  our  analysis,  we  make  no  assumption  about  the  form  of  the  distri¬ 
butions  (except  that  they  satisfy  certain  mild  regularity  conditions, 
given  later  on  in  Section  2.1)  or  the  existence  of  any  sufficient  statistic 
(for  each  sample  size)  for  the  ranking  parameter.  The  problem  is  solved 
for  an  arbitrary,  but  fixed,  location  of  the  ranking  parameter.  If  the 
ranking  parameter  is  a  location  or  scale  parameter  admitting  a  sufficient 
statistic,  then  the  results  hold  irrespective  of  the  specified  location 
of  the  parameter  and  hence  the  procedure  developed  is  an  asymptotically 
optimal  ranking  procedure.  In  other  situations  one  may  be  able  to  find 
"least  favorable  location"  of  the  ranking  parameter  and  thus  the  solution 
to  the  ranking  problem  at  such  a  location  gives  an  asymptotically  optimal 
procedure.  If  in  the  worst  case,  one  cannot  find  such  a  least  favorable 
location,  the  procedure  developed  by  our  method  gives  an  asymptotically 
optimal  identification  procedure,  for  any  arbitrary,  but  fixed,  location 
of  ranking  parameter. 

Since  the  problem  of  selecting  the  best  population  seems  to  be  of 
most  practical  interest,  and  also  because  the  main  ideas  in  the  theoretical 
development  of  asymptotically  optimal  (minimax)  ranking  procedures  are 
illustrated  in  this  case,  we  treat  in  detail,  the  development  of  an 
asymptotically  .ptimal  procedure  for  the  problem  of  selecting  the  best 
population.  For  certain  additional  ranking  goals,  we  develop  asymptotically 
optimal  procedures  by  reducing  the  analysis  to  one  very  similar  to  the 
problem  of  selecting  the  best  population. 


CHAPTER  2 


ASYMPTOTICALLY  OPTIMAL  PROCEDURE  FOR  SELECTING  THE  BEST  POPULATION 

2.1  Notation,  Assumptions  and  Regularity  Conditions 

Let  X^(i  =  1,2,..., n)  denote  observations  from  population 

n  (t  =  l,2,...,k),  and  write  XA  =  (Xli,X2^,...,Xk^).  In  this  thesis, 
we  assume  that  Xj,X2,...,X  are  independent  and  identically  distributed 
random  vectors. 

Let  01,e2,...,ep  denote  unknown  nuisance  parameters,  common  to 
each  of  the  k  populations.  ,  (t  =  l,2,...,k)  denotes  a  scalar 
valued  parameter  of  population  nt>  The  populations  are  ranked  according 
to  the  values  of  the  parameter  ^  (the  large,  che  value  of  ifj,  the  better 
the  population  is  considered  to  be).  For  convenience  in  notation,  let 

(2.1)  '  eo  *  ep+t  1  *  1*2»***'k 

Thus,  eQ  may  be  regarded  as  the  common  value  of  the  ranking 
parameters  and  e  ^  (t  =  l,2,...,k)  may  be  viewed  as  shifts  of 
the  parameter  from  the  common  value  0Q. 

The  density  function  of  the  random  vector  X  is  represented  by 

® \ » ® 2 * *  * ' * ®p* ^p+1 ' ' * *  ’^p+k ^ *  Denoting  (®q»®i»  •  • • *®p+k^ 

3,  the  density  function  will  be  commonly  represented  as  f(x;0). 

The  basic  theory  is  developed  for  a  class  of  density  functions 
satisfying  certain  mild  regularity  conditions,  which  we  state  in  the  form 
of  the  following  assumptions: 
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Assumption_l:  f<xi  6)  exists  and  is  continuous  for  all  x 

a  B  y 

all  0  and  all  a,  B,  y  (a,8,y  x  1, 2, . . . ,p+k) . 

Assumption  2:  The  (p*k)* (p+k)  matrix  whose  (i,j)t^  element  is 

Ee{fo~  log  f(X;8)  log  f (X ; 0 ) }  (i,j  =  l,2,...,p+k) 

i  j 

is  assumed  to  exist  and  is  positive  definite  for  all  9. 

Fix  a  positive  quantity  L  which  will  remain  fixed  throughout  the 
analysis.  For  any  positive  quantity  M,  and  any  0  =  ^eo,0l'*’*'0p+k^’ 
let  R^(e)  denote  the  following  region  in  0-space. 

i  =  1,2, ... ,p 
i  *  p+l,p+2, . . . ,p+k 

Assumption  3:  For  any  F,  there  exists  a  sequence  of  positive 

values  { Mn (0 ) }  with 


(2.2) 


|0i  "  9i>  I  7= 
1  1  ✓n 


1 0 .  -  e. |  <  — 
1  1  ~  ^ 


(2.3) 


lim  M  (6) 

n-K» 

M  (6) 
iim 

n-**>  n 


CO 


0 


and  such  that,  if  {w(n)}  and  { y (n) >  are  any  two  sequences  in  0-space 
with  u>(n)  £  (6)  and  \i  (n)  e  (6)  for  each  n,  then 
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^72 


n 


n 

l 

i  =  l 


l30  30.36 
a  6  y 


log  fixi;e)ju(n) 


converges  stochastically  to  zero  as  n  increases,  when  u,(n)  is  the 
true  parameter  point  for  X.  Also  this  convergence  is  uniform  in 
u(n)  an,’  ui(n)  over  R^J  ^  (?) . 


Assumption  4; 

dx 

0 


*  30.36 

.«  g  y 


f  (x;0) 


3  log  f (x;0) 

Q 


is  uniformly  bounded  for  all  u(n)  e  ^(e)  for  all  a,  g,  y 
(a.B.Y  =  l,2,...,p+k).  n 

For  a,g  *  l,2,...,p*k,  let 


(2.4) 


Bn(a.g;r)  = 


I  l 

n 


n  d2 


log  X. ; 0) 

n  ,L.  36  36,  6  x 

1*1  a  6 


and 


f 


(2.5) 


3(a,8;6)  *  -E 


V\  30  30  . 
6  a  6 


log  f (X;0) 


Assumption  5:  If  ai(n)  is  the  true  parameter  point  for  X, 

w(n)  e  R^  (0”) ,  then  B^a.gjfT)  converges  stochastically  to  Bt&.ajO’) 

and  B(a,g;  (T)  is  a  continuous  function  of  8,  for  all  a, 6  (a, 8  =  1 , 2, . . ,p«V, 
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In  our  analysis,  we  consider  only  those  procedures  which  take 
equal  size  samples  from  each  population.  We  do  not  assume  the  existence 
of  a  sufficient  statistic  for  the  ranking  parameter  or  the  nuisance 
parameters;  and  the  decision  rule  is  given  in  terms  of  the  density 
function  f(x;6). 

Apart  from  the  regularity  conditions  stated  above,  we  make  no  other 
assumptions  about  the  form  of  the  density  functions.  Thus,  the  results 
obtained  in  this  chapter  (and  also  in  Chapter  3)  are  applicable  to  the 
problem  of  ranking  several  univariate  populations  according  to  the 
values  of  a  certain  parameter,  each  population  having  common  unknown 
nuisance  parameters.  The  results  also  apply  to  problems  of  ranking 
several  multivariate  populations,  the  populations  being  ranked  according 
to  values  of  a  scalar  valued  parameter  and  each  population  having  common 
unknown  nuisance  parameters.  The  results  are  also  applicable  to  certain 
ranking  problems  associated  with  a  single  multivariate  population.  The 
results  are  applicable  in  situations  where,  apart  from  the  regularity 
conditions  stated  above,  certain  symmetry  conditions  (on  the  density 
function  f (x ; 0 ) )  hold.  These  symmetry  conditions  are  needed  in  showing 
that  a  symmetric  prior  distribution  gives  a  minimax  decision  rule.  Thus, 
our  results  apply  to  the  problem  of  ranking  from  a  multinomial  distribu¬ 
tion  as  well  as  ranking  means  or  variances  of  a  single  multivariate  normal 
population  with  common  (known  or  unknown)  correlation  coefficient. 

2.2  Some  Preliminary  Results  of  Statistical  Decision  Theory 

In  order  to  put  the  ranking  problems  considered  in  this  thesis  into 
the  decision  theoretic  framework,  and  present  a  general  theoretical 
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development  of  an  optimal  procedure  for  different  types  of  ranking 
problems,  we  define  some  notation  and  present  some  preliminary  decision 
theoretic  results  which  will  be  used  later  on.  The  notation  is  similar 
to  that  used  by  Weiss  [49], 

Let  X,,...,X  be  the  observable  random  variables,  on  the  values 
of  which  the  decision  is  to  be  based.  Let  x  be  an  index  for  the  possible 
sets  of  values  of  (X ^ .X^, . . . .X^) .  Let  f(x;0)  denote  the  joint  pdf 
of  (X  ,...,X  ),  where  6  is  an  index  for  the  possible  joint  distribu¬ 
tions. 

Let  D  be  an  index  for  the  possible  decisions;  that  is,  a  particular 
value  of  D  indicates  a  particular  decision.  In  the  case  where  there 
is  only  a  finite  number  of  decisions,  say  h,  we  can  list  the  decisions 
in  a  particular  order  and  let  Di  (i  *  l,2,...,h)  indicate  the  ith 
decision.  Since  for  ranking  problems,  the  number  of  decisions  is  finite 
(this  total  number  depending  upon  the  number  of  populations  and  the  specific 
goal  being  considered),  this  notation  will  be  used. 

In  the  decision  theory  formulation,  different  ranking  problems  would 
be  analyzed  in  the  same  way,  differing  only  in  the  total  number  of  possible 
decisions  h,  and  interpretation  of  each  decision  (depending  on  the  rank¬ 
ing  goal).  For  example,  for  selecting  the  best  population,  the  total 
number  of  possible  decisions  h  is  equal  to  the  number  of  populations 
k;  and  1).  may  refer  to  selecting  as  the  best  population.  For 

the  problem  of  selecting  a  fixed  size  subset  of  size  s  (s  <•  k)  to  con¬ 
tain  the  best  population,  the  total  number  of  possible  decisions  h  is 

k 

(^ )  and  each  may  refer  to  selecting  a  pm  tirular  subset  of  s  i  ze  s 


as  the  best  subset. 
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Let  W(6,D,x)  denote  the  loss  incurred  when  x  is  the  observed 
value  of  (X1....,Xn).  D  is  the  decision  made  and  6  is  the  true 
parameter  value.  For  a  large  class  of  problems,  W(  )  may  be  indepen¬ 
dent  of  X. 

Definition:  A  decision  rule  s  is  defined  by  nonnegative  numbers 

s(D;x),  where  s(D;x)  is  the  probability  assigned  by  decision  rule  s 
to  choosing  a  decision  D  when  x  is  observed. 

When  the  total  number  of  possible  decisions  is  finite,  say  h,  then 
we  have  for  each  x, 

h 

(2.6)  l  s(D.;x)  «  1 

i-1  1 

Definition:  The  expected  loss,  incurred  when  using  decision  rule  s, 

and  the  true  joint  probability  distribution  is  given  by  6,  is  denoted 
by  r(6;s),  and  often  called  the  risk  function. 

For  a  problem  with  a  finite  number  h  of  possible  decisions  and 
joint  pdf  f(x;6)  (for  random  variables  (Xj ,X^, . . . ,xn)) 

•  »  h 

(2.7)  r(0;s)  *  /  •••  /  [  W(0;D.;x)  f(x;9)  s^jx)  dx1#...,dxn 

-UD  —CO  1  *  1 

Let 

•  max  r(0;s) 

0 


(2.8) 


M(s) 
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Definition:  The  expected  risk  for  a  decision  rule  s,  with  respect  to 

a  cdf  B(6)  for  chance  variable  9,  is  denoted  R (s ;  B(0))  and  given 
by 

(2.9)  R(s ; B(e) )  *  E  { r (6;s) } 

B(0) 

Definition:  A  decision  rule  s  is  a  "Bayes  decision  rule  relative  to 

B(0)"  if  for  every  decision  rule  t, 

(2.10)  R(s;B(6) )  <_  R(t;B(0)) 

Definition:  A  decision  rule  s  is  called  a  minimax  decision  rule,  if 

for  every  decision  rule  t, 

M(s)  <_  M(t ) 

B(0),  used  for  constructing  a  Bayes  decision  rule  is  often  called 
an  "apriori  distribution."  he  would  like  to  point  out  that  0  is  an 
unknown  vector  and  not  a  chance  variable.  The  introduction  of  the  cdf 
B(0)  is  just  a  technical  device  to  enable  one  to  define  a  Bayes  decision 
rule  for  the  case  of  an  infinite  number  of  possible  distributions  (indexed 
by  6). 

From  a  Bayesian  viewpoint,  one  may  specify  some  particular  cdf 
B*(0)  and  construct  a  Bayes  decision  rule  relative  to  the  specific 
apriori  distribution.  In  this  thesis,  however,  we  are  only  interested 
in  minimax  decision  rules,  and  Bayes  decision  rules  are  only  used  as  a 
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* 


technical  device  to  construct  such  minimax  decision  rules. 
If  B (9 )  has  a  pdf  b(0),  then 


(2.11)  R(s;B(6))  *  /  r(0 ; s )b (0)d0 

0 


For  a  problem  with  a  finite  number  h  of  possible  decisions,  and 
B(0)  having  a  pdf  b(0),  we  obtain  using  (2.7)  and  (2.11) 


R(s;B(0)) 

(2.12) 


QO  00 


/  (  /..../ 


h 

l  s(D  ;x)  W(0;D.  ;x)  f(x;0)dx 
i*l  1  1  1 


.dx  )b(e)de 
n 


ao 


a  h 


*  /••••/  (  I 


s(Di;x)  k(D. ;x)]dx1...dxn 


where 


(2.13)  k(D,;x)  -  /  W(0;D.;x)  f(x;0)b(0)d0 

*  0  1 

Using  the  above  representation,  we  easily  see  that  "s  is  a  Bayes 

decision  rule  relative  to  B(0),  if  for  each  x,  s(D;x)  is  set  equal  t 

zero  for  every  D.  for  which  k(D.;x)  is  greater  than  min  (k(l).,x)}. 

1  1  l<i<h  1 

We  end  this  section  by  stating,  without  proof,  two  well-known 
theorems  which  enable  one  to  recognize  a  decision  rule  as  a  minimax 
decision  rule. 

Theorem  2,1.  If  s  is  a  Bayes  decision  rule  relative  to  B(0),  and 
if  r(<T;s)  =  M(s)  for  every  0*  which  is  a  point  of  increase  of  B(6), 


then  s  is  a  minimax  decision  rule. 
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If  the  number  of  possible  distribution  functions  is  finite,  and  if 

we  let  b (8 b(0  )  denote  the  apriori  distribution  for  the  possible 
l  m 

values  (6, ,,..,6  )  of  9,  then  we  obtain  the  following  special  form 

1  D 

of  Theorem  2.1 

Theorem  2.2.  If  s  is  a  Ba/es  decision  rule  relative  to  b(6j),b(9_) , . . 

b(e  )  and  if  r(6;s)  =  m(s)  for  every  e  for  which  b(e)  is  positive, 
m 

then  s  is  a  minimax  decision  rule. 


2,3  Selecting  the  best  population 

In  this  chapter,  we  derive,  in  detail,  the  basic  results  needed  to 
develop  an  asymptotically  optimal  procedure  for  the  problem  of  selecting 
the  best  population,  when  the  density  function  of  observations  satisfies 
the  regularity  conditions  given  in  Section  2.1.  Since  we  are  dealing 
with  the  indifference  zone  approach  to  the  ranking  problem,  we  need  to 
define  the  preference  zone  for  the  problem  at  hand.  As  we  are  interested 
in  developing  a  procedure  (really  a  sequence  of  procedures)  which  is 
asymptotically  optimal,  we  defii.;  the  following  sequence  of  preference 
zones  (one  for  each  n) . 


(2.14)  (6*  (n) )  =*  (w  c 


where  c  >  0,  ip  is  the  parameter  being  ranked,  and 
denote  the  ordered  parameters. 


*[1]  i*[2]  1 
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* 


Also,  for  any  sequence  of  procedures  {T(n)},  let 

(2.15)  P(T)  =  lim  inf  P{CS|T(n)} 

n-*-®  ft  (6*  (n)) 

Then,  throughout  this  thesis,  an  asymptotically  optimal  selection  procedure 
is  defined  as  a  procedure  T  (really  a  sequence  of  procedures  (one  for 
each  n))  such  that  among  the  class  of  all  procedures  T', 

(2.16'»  P  (T)  >_  P(T») 

In  order  to  develop  an  asymptotically  optimal  procedure  (that  is 
a  procedure  for  which  (2.16)  holds),  we  formulate  the  problem  as  a 
decision  theory  problem  with  a  particular  zero-one  type  loss  function, 
and  obtain  an  asymptotically  minimax  decision  rule  for  the  associated 
multiple  decision  problem.  This  is  done  in  detail  in  the  next  section. 

We  would  like  to  point  out  here  that  in  developing  selection  pro¬ 
cedures  (for  ranking  problems)  which  are  asymptotically  optimal,  the 
problem  of  defining  a  suitable  preference  zone  is  solved  in  a  nice  way. 

We  are  interested  in  defining  a  sequence  of  preference  zones  in  such  a 
way  that  the  distance  between  the  best  and  the  second  best  population 
approaches  zero  as  n  -*■  ®.  That  we  require  such  a  sequence  for  the 
asymptotic  theory  is  clear  by  the  fact  that  if,  for  example,  the  distance 
between  best  and  second  best  population  were  some  constant,  then  for  any 
procedure  using  a  consistent  estimator  of  the  ranking  parameter,  P(T) 

(as  given  by  (2.15))  would  be  equal  to  one  (that  is,  one  is  able  to 
select  the  best  population  with  probability  approaching  one  as  n  ®). 
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Thus,  the  preference  zone  as  defined  in  (2.15)  is  appropriate  for  the 
ranking  problem.  The  distance  function  used  in  defining  the  preference 
zone  is  a  "natural"  distance  function  for  the  problem  of  ranking  popula¬ 
tions  according  to  values  of  the  location  parameters.  For  asymptotic 
theory,  this  is  an  appropriate  distance  function  for  ranking  any  para¬ 
meters  of  a  distribution,  a  result  not  too  surprising  in  view  of  the 
fact  (as  we  shall  see  in  the  next  section)  that  the  problem  (at  least 
asymptotically)  reduces  to  one  of  ranking  means  of  normal  populations 
for  which  the  distance  function  used  is  a  natural  distance  function. 

For  asymptotic  theory,  in  the  sequence  of  preference  zones  (as  defined 
by  (2.14))  the  rate  of  convergence  of  the  distance  function  to  zero 
(consequently  the  rate  of  convergence  of  preference  zones  to  the  whole 
parameter  space)  is  l//n.  This  is  directly  related  to  the  normalizing 
constant  (/n  for  the  class  of  problems  being  considered),  for  which 
the  decision  variables  have  a  limiting  distribution.  If  the  rate  of 
convergence  is  too  slow,  the  problem  reduces  to  a  degenerate  case 
while  if  the  rate  of  convergence  is  too  fast,  the  decision  rules  will 
not  be  able  to  distinguish  the  best  population  among  the  set  of  competing 
ones. 

For  i  =  1,2, ...,k  let 


(2.17) 


0 

H. : 
i 

9 

P+l 


0  .  +  C  I’/vi 

P+J 


j  —  1,2, ...  ,k 
i  i 


where  c  >  0  and  0  ^  =  0^+^  all  i,j  (i,j  =  l,2,...,k)  are  known  values. 
0p+j  may  be  taken  to  be  zero,  with  no  loss  in  generality. 
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In  view  of  (2.1),  (2.17)  represents  a  restricted  parameter  configura¬ 
tion,  in  which  if  H.  is  the  true  state  of  nature,  then  n.  is  the  best 
’  l  *  i 

population. 

We  now  discribe  the  basic  method  used  in  developing  asymptotically 
optimal  ranking  procedures.  In  order  to  develop  an  asymptotically 
optimal  procedure  for  the  ranking  problem,  we  develop  an  asymptotically 
optimal  procedure  for  an  associated  identification  problem,  with  eQ, 
the  common  location  of  the  ranking  parameter,  as  the  least  favorable 
location  of  the  ranking  parameter.  We  first  solve  the  problem  for  a 
restricted  parameter  configuration,  given  by  (2.17),  and  then  show  that 
the  procedure  developed  is  minimax  overall  parameter  configurations. 

As  in  [55],  we  first  solve  a  sequence  of  artificial  problems,  the  solu¬ 
tion  to  which  suggests  an  optimal  procedure  for  the  real  ranking  problem. 

2.4  Asymptotically  Optimal  Procedure  for  Zero-one  Type  Loss  Functions 

For  the  problem  of  selecting  the  best  population,  let  denote 
selecting  Ih  (as  given  by  (2.17))  as  the  true  hypothesis  (equivalently 
selecting  rh  as  the  best  population).  The  loss  function  is  given,  for 
i  =  1,2, ...,k,  by 

/ 

0  if  is  the  true  hypothesis 

(2.18)  W(6,D.;x)  =  W(9;D.)  =  l 

1  otherwise 

2.4.1  Preliminary  Sequence  of  Artificial  Problems 

For  the  loss  function  (2.18),  and  with  the  joint  density  functions 
of  the  populations  satisfying  the  regularity  conditions  (of  Section  2.1), 
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we  would  like  to  develop  an  asymptotically  optimal  procedure  for  the 
problem  of  selecting  the  best  population.  Before  proceeding  to  the  real 
problem,  we  first  solve  a  sequence  of  artificial  k-decision  problems 
(one  for  each  n). 


Suppose  it  is  known  that  for  i  =  l,2,...,p,  we  have 

M  (0) 


M_  (0  ) 

e  _  J} —  <  e.  <  9.  ♦  J! 

i  / —  —l—i 


'n 


and  (0  ,  ,0  6  ,)  satisfy  one 

^  P+1*  P+2  p+k 


of  the  k  hypotheses  given  by  (2.17),  (e"^ , . . . ,0^)  are  known  constants 
and  0  <  c  ^  L.  We  wish  to  test  which  one  of  the  k  hypotheses  Hj,H2>...,Hk 
is  the  true  one. 

For  the  above  problem,  which  is  an  artificial  one  because  we  assume 
^,...,0^  are  known,  we  construct  a  Bayes  Decision  Rule  relative  to 
the  following  apriori  distribution:  For  j  =  l,2,...,k  a  total  mass 
of  bj  is  spread  uniformly  over  the  set 


_  M0)  _  Mn(0) 

0.  -  — -  <  0.  <  0.  +  — - 

1  r~  -l-i 


i  =  l,2,...,p  and  IF  is  true 


where  b^  >_  0  for  j  =  l,2,...,k,  and 


ft 

Z  b  =  1 
j=l  ] 


The  prior  distribution  (b^ ,  j  *  l,2,„.,k)  is  arbitrary,  but 
fixed.  Later  we  select  the  prior  in  such  a  way  as  to  obtain  a  minimax 
decision  rule  for  the  problem  at  hand. 
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If  we  let  I),  denote  the  decision  to  select  H.  as  the  true 
l  1 

hypothesis  and  compute  k(U.;x)  for  each  i,  then  it  can  be  seen  that 
a  Bayes  decision  rule  relative  to  the  above  apriori  distribution  is  given 
as  follows: 


Select  H  as  the  true  hypothesis  if 


(2.19) 


J(llj)  i  “  1»2< 

l 


where  for  j,i  ■  1,2,...  ,k 


H  (6)  M 

0,  ♦— -  0  ♦JL_ 


M  (0) 


n  f(X  ;0,H  )de  ...,do 

III  1  1  p 


'  M  (0)  >  M  (0) 

~  .  J2 -  8  — — 

1  ^  p 


(2.20)  Jn(t|j) 


-  V? 


vir 


m  («) 

8  ♦_!! — 

fp  ^ 


MJe) 
o  ~ n — 

p  g 


n  f(X.;9,H.)de  J,...,dft 
i-1  1  J  p 


Merc  f(X.  ;b,H  )  denotes  the  joint  pdf  of  the  observations  when  0  is 
the  parameter  value  and  (0^  ^ , . . .  ,0^^)  are  as  given  by  hypothesis  hh 


for  notational  convenience,  we  let 


n  f(x.;0,H.)do ...... de 

i»l  1  1  f 


....  n  f  (X.  ;»,H.  )d«.  ,...,db 


i*  *  y  i 
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where  the  integration  is  over  the  region  as  given  by  (2.20).  Throughout 
the  thesis,  unless  otherwise  specified,  /*••/  indicates  integration 
over  a  region  as  given  by  the  last  expression  for  the  integral  where 
the  limits  are  specified. 

We  investigate  the  asymptotic  properties  of  J  (i|j).  We  now 
define  certain  notation  used  frequently  later  on.  Let  6  denote 
(0q » » •  •  • , 6  , j , • * * ^p+fc) *  Also  for  a  =  l,2,...,p+k,  let 


(2.21) 


An(a;S>  =  -p  l  1st  log  f CXj ; e) 
/n  1=1  a 


Denote  by  Fn(a,B;0),  a, 8  =  1,2,. ..,p,  the  (a,B)th  element  of 

the  inverse  of  the  (pxp)  matrix  whose  (a,B)th  element  is  Bn(a,B;F), 

where  Bn(a,8;0)  is  as  given  by  (2.4).  This  inverse  exists  with  proba 

bility  approaching  one  as  n  increases. 

For  a, 6  =  1,2,.. .,p,  let  F(a,B;6)  denote  the  (a,B)1  element 

of  the  inverse  of  the  (p*p)  matrix  whose  (a, 6)*^  element  is  B(a,B;fT). 

By  our  assumptions,  this  inverse  exists.  Also  by  our  assumptions,  if 

the  true  parameter  point  for  X  is  oj (n)  e  ,—.(6),  then  F  (a, 6;?) 

Mn(eJ  n 

converges  stochastically  to  F(a,8;0)  as  n  increases. 

Using  the  above  notation,  we  get  by  expanding  around  0*,  for 
0  =12  k 


n  n  p+k 

l  log  f(Xi;e,Hjl)  =  l  log  f(X  ;0)  ♦  l 

i=l  i=l  a=l 


.  p+k  p+k  _ 

7  l  l 

a= 1  8=1 


>/n(0  -0  )A  (a;0) 
a  a  n 

+  Qn(9l'  *  *  *  ^p1  £) 


(2.22) 
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where. 


(2.23) 


.  p+k  p*k  p*k 

Vh . 9pl£)  *  6  l  J,  l,  ‘W'VV'VV 

a=l  6=1  y=l 

n  3 

£  30  36  30  log 

1=1  a  6  Y 


and  in  (2.22)  and  (2.23),  we  have  set 


) 

Vi 

c 

P*3 

/K 

V* 

c 

W  ' 

*  ”/n 

i)  • 

i  lemma 

which 

will  be 

and  u  (n)  e  (F)  . 

We  now  prove  a  lemma  which  will  be  useful  in  studying  asymptotic 
properties  of  Jn(t|j). 


Lemma  2 . 1  For  l  *  1,2, ...,k,  0^(9^ , . . . ,6^ | £ )  converges  stochastically 

to  zero  as  n  -*•  *=  for  all  u(n)  e  (0)  . 

n  6 

Proof :  From  (2.23),  we  see  that 


1  Mn<6) 

Qn(0l****'0pll)  1  F  ~ 5 77 

P*k  P*k  Ptk 

v  33 

l  l  l 

i=l  0=1  y=1 

.  ^  36  30  30  l0g  f  * 

r  n 

1=1  a  B  y 

u(n) 

0  as  n  -*■  «>  by  our  assumption. 
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Thus  Qn(9|,...,9  |i)  converges  stochastically  to  zero  as  n  -*  ■»  for 


311  y(n)  e  Si  (e)(e)* 


Q.E.D. 


Using  (2.22),  we  obtain,  for  I  =  1,2, ...,k. 


n 


n 


n  fex.je.H^)  =  exp{  l  log  f (xi;e,Hft) } 
i=l  i=l 


n  p*k 

exp{  [  log  f (x i ; o’)  ♦  l  */n(ea-eajAn(a;e) 


(2.24) 


i=l 


c=l 


H 


-  y  l  l  Bn(a,B;e) 

a,  6=1 


♦  Qn(ei'e2'***’ep,4)} 


Substituting  from  (2.24)  in  (2.20),  we  obtain,  for  j,e  *  l,2,...,k. 


V‘l» 


p+k 


exp{  l  log  f(X  ;9)  ♦  l  ^n(e  -e  )A  (a;0) 

;  _  l  *  _ i  via  11 


1=1 


a»l 


-  J  ?  I  /^(6  -6  )/^(e  -e  )Bn(a,B;e 


2  LaL.  '"a  a  v  6  “ft'  n’ 
a, 6=1 


B) 


♦  ^n(bl*--*0p|£),dei»--*d6f 


p*k 


exp{  l  log  ffX^e)  ♦  l  /n(0a-6j)An(a;e) 


i  =  l 


a=l 


-  J  ni^(V5a>/5'(VeB,Bn{“>'1:9> 

a, 6=1 


"j 


where  H„(H.)  inside  the  bracket  is  used  to  indicat  that  hypothesis 

*•  J 


H.  (H.)  is  true. 
l  j 


Simplifying  the  above  expression,  we  obtain,  for  £, j 


(2.2S) 


V‘li>  - 

r 


exp 


exp 


-2cAn(p*£;6) 


-2cAn(p>j;6) 


.  p+k  p*k 

7  II  ^(e0-9  )/S(8  -e  )B  (.,8;6) 

a»p*l  6=p*l 

2  I  Z~ 

7  l  l  ^(ea-ea)^(eB-e6)Bn(a,B;e) 

a=p*l  6»p*l 


If  we  denote 

Qn(e1.....op 

denominator 


by  the  above  expression  cor  Jn(£|j)  with 

1 1)  and  ,6p|  j)  being  removed  from  numerator  and 

respectively,  then  we  obtain  the  following  useful  lemma. 


Lemma  2.2.  For  l,j  =  l,2,...,k 

J  U|j)  stochastically 

XTTTTj  * 


p 


i 
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for  all  true  parameter  points  u  (n)  e  R^  ^(e)  • 

Proof :  Using  the  law  of  the  mean  for  integrals,  we  can  write 

Q,U) 

-  - -  JnU|j) 

T(j)  n 

p 

where  Qn (i )  (i  =  j,l)  is  between  the  minimum  and  maximum  values 
taken  by  Qn  (6  ^ .  ,8  ;  i )  in  the  region  of  integration.  Using  Leiuna  2.1, 

it  follows  immediately  that  if  u  (n)  is  the  true  parameter  point  for  X 
n  -  J,'/  i) 

and  u(n)  e  ,-.(8),  then  j  y  converges  stochastically  to  one 
r.  ‘  nv  1 

as  n  -*•  «>. 

Q.E.D. 

As  a  direct  consequence  of  le.:..ia  2.2,  to  study  the  asymptotic 
properties  of  Jn(fc|j),  we  reed  only  stud/  the  asymptotic  properties  of 
If  we  let  =  i/nVe^-6  ),  a  =  1,2, ...,p,  we  get  for 
£»j  =  1 »2, . . .  ,k  t 


* 


-38- 


(2.26) 


8)4  S  '  .;1(9  -0  )4(e  -BJB  (a,6;€) 


exp  -2cAn(p+£;0) 


a,. 


a  a'  v  «  0  ii 


exp  ;  -2cA  (p*j;0) 

L  “  a,B*p*l 


0)  -  i  \l  /n(6  -0  1^(6  -6, 


a  -o' . 0 


H  _ 
3 


M  (0)  M  (?) 
n  n 


-M  (0)  -M  (0) 
nv  ’  nv 


exp 


P 

I  w  [A  (a;0)  -  c(  \  B  (a , y ; 0) 


a=  1 


a*  n 


Y*p*  1 
tp+t 

■  I  0.  VA'"-91®1 

a,8*l 


BnU,p**;0))] 


dw  dw 
1  P 


Mn(0)  Mn(6)  f 


exp 


-M„(0)  -M  (?) 


p*k 

w  [A  (a;6)  -  c(  l  B  (a ,y ;0) 
a=l  “  "  y-p*l  " 

tP'j 

■  1  fl.  w<.W“’S;®) 

a  ,8“1 


Bn(a,p^j;0)]) 


dwr-*dwp 


Since  M^(?)  "*  •*  as  n  ♦  ®,  it  would  be  tempting  to  set  in  the  limits  of 
integration  M^(?)  ■  ®  and  conclude  that  the  resulting  value  of  the  expres¬ 
sion  (2.26)  would  have  the  same  asymptotic  behavior  as  J^(t|j).  However, 
since  the  integrand  is  also  a  function  of  n,  through  Bn(a,B;8),  a  care¬ 
ful  analysis  is  required. 

Before  proceeding  to  that,  denote  by  J^(£|j)  the  value  of  J^U  ') 
if  in  the  limits  of  integration  in  tho  expression  for  J^(fc|j)  ((2.26)), 
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Mn(e)  are  replaced  by  •*>.  By  treating  wa  (a  =  l,2,...,p)  formally 
as  having  a  multivariate  normal  distribution  with  mean  vector  0  and 


covariance  matrix  given  by  II Fr  (a,B;8)  I J ,  we  get  for  i,j  =  1,2,. ..,k. 


(2.27) 


exp  |  -2cAn(p*£;8) 


7  l  I  ^(ea-ea)^(e6-eg)Bn(a,B;e) 

a,8=p*l  H£ 


-2cAn(p>j;0)  -  j  l  l  ^(ei-ea)^(ee-e6)Bn(a,B;0) 
a,8*p*l 


i  F  _  F  *  _  _  _ 

7  l  l  (An(a;0)  -  c(  [  B  (a,y;6)  -  ®  (a ,p*i ; 0) ) )  Fn(a,B;0)- 

a,B  =  l  y  *p* 1 

exp  ^p«-l 

p>k 

MAn(6;8)  -  c (  [  Bn(B,y;e)  -  Bn (B ,p*£ ;0) ) } 

y*p*l 

rr~  P  :  A  _  -  -  - - ~ 

7  I  l  ^  (a;e)  -  c(  l  B  (a,> ;6)  -  B  (a,p*j ;8))  }F(a,6;e). 

1  a , B  =  1  "  y  =p> 1  °  n  n 

exp  ^p*j 


•(An(B;F)  -  c(  l  Bn(0,y;fl)  -  Bn(B.p*j;9)) 

rap*i 

tp+j 


Aftc’-  some  simplification,  we  get 
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(2.28) 


J"(*l3) 


exp 


2cAn(P+j;6)  “  2cA  (p+£;0)  *  2C  l  l  A  (a;6)F (a,6;8) 

0,0*1 


Bn(8,p>£;e)  -  Bn(s,p*j;e)] 


exptc^  (6;2|j)] 


where 


C2Vn(6;£  |  j)  *  ~  5  l  ^(8a-ea)/^(6  -e  )Bn(n,B;6) 

a,8*P^l 


-  J  !  I  ^n(eo-e  )/S(e  -8  )B  («,s;e> 

a,8«p*l 


(2.29) 


H. 

J 


H„ 


♦r  0,  F"(a,B;8)i 

a, 8*1 


p*k  _  _  p«-k 

(  I  B  (a,>;0)-2B  (o,p*j;0))(  J  B  (8,y;6)-2B  (6,p*j;0)> 

Y-p*l  "  "  Y*P*1 


p*k  _  p*k 

<  l  B  (o,Y;e)-2Bn(a,p*£;0)H  £  Bf|(0,Y;0)-2Bii(B,P*£;0) } 

Y«p*l  n  n  Y*p^l 


After  some  simplification,  we  get,  for  l,j  =  1,2, ...,k. 
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(2.30) 

Vn(6;£|j)  = 


1 

2 


p+k  p+k  _  p+k 

l  l  B  (a,0;6)  -  2  [  B  (B,p+j;0)  +  B  (p+j ;p+j ; 6) 

a=p+l  g=p+l  3=p+l 

^P+j  ^P+j  ^P+j 

p+k  p+k  _  p+k 

-l  l  B  (a,g;6)  +  2  [  B  (B,p+£;6)  -  B  (p+£;p+£;0) 

a=p+l  B=p+1  B=p+1 

/p+£  ^p+£  ^p+£ 


♦2  fl  F(«»B;e) 

a,B=l 


{  l  B  (a,Y;0 j ) * ( B  (8,p+£  ;  0)  -  B  (6,p*j;0)} 

Y=p+1  "  ° 


-Bn(i,p+£;0)Bn(B,p+£;0)  +  B^a.p+j  ;6)Bn(0,p+j  ;  0) 


We  now  proceed  to  examine  the  asymptotic  properties  of  J^(£|j).  A  lemma, 
which  will  be  useful  in  determining  the  asymptotic  distribution  of 
J^(£|j)  is  first  proved. 

For  a  =  l,2,...,p+k,  let 


(2.31) 


An(a;0) 


P*k 

A  (a;0)  -  l  /n(w  (n)-0  )B(a,B;0) 
°  6*1  6  6 


Lemma  2.3  If  the  true  parameter  point  for  X  is  w(n),  u> (n)  e  ,^(9), 

then  A  (1;P),A  (2,0),...,An(p+k;y)  Lave  asymptotically  a  joint  normal 
distribution,  with  zero  means  and  covariance  between  ^n(a;0)  and 
An(ti;0)  given  by  B(a,B;e’). 


-42- 


Proof :  If  w(n),  u)(n)  e  ^  (Q ) ,  is  the  true  parameter  point  for 


X,  then  for  a  =  l,2,...,p+k. 


E  \  [A  (a;0)  1  =  E  , 

u(n)1  nv  ’  w(n) 


-p  l  hr'°*  f  CXj ;  e ) 

/n  1=1  a 


6  J 


(2.32) 


=  /n  /  f (x,u>  (n) )  |p  log  f  (x;0) 


dx 


=  */n  J  f (x ; 6  ♦  (u>(n)  -  9))  p-  log  f(x;0) 


dx 


Expanding  f(x;u(n))  around  9  and  denoting,  for  notational  convenience. 


Jq~  log  *(x;0) 


by  p-  log  f(x;0),  we  obtain  for  a  =  l,2,...,p+k, 
8  “ 


WVai9)> 


=  Sn  f  f (x;9)  |p-  log  f(x;9)dx 


P*k  ®, 

♦  [  /n(u>  (n)-9  )  /  f(x;9)  —  log  f(x;9)dx 


6=1 


B  >  39 

-•»  e 


(2.33) 


*  t  ji  <-6 oo-P w»>-%>  J  hr  l0‘  f(x-9)dx 


6,y*1 


By  a 


Thus , 


where  u(n)  e  ^(9)  . 


p>k 


E  =  I  -/n(w  (n)-6  )  J  log  f (x ; 9 )  J—  log  f (x;>>) f (x; y )dx 

w(n)  n  6  =  1  H  6  -»  39a 


3 


*  T  fl  S(">-'V(V">-V  j  3—  1°«f|,-8»dl< 

6,y*1  -■ 


Te  39  39 

by  j 
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or , 


p+k  _  _ 

Eu(n)fAn(o;0)1  =  %  ^(“B(n)-eB)B(o.6;e) 

B  =  1 

(2.34) 

-  b,y-1  p  "  '  1  -«*  ”“B  '"'y  "o 

where  u(n)  e  rJJ  ^ }  (?)  . 

From  (2.34)  and  the  assumptions  made  in  Section  2.1,  it  follows 

that 

E  .  J  A  (a ;  8 ) 
w(n)  n  ’ 

(2.35) 

< 

where  K(6)  is  a  fixed  positive  constant.  Thus  from  (2.35),  it  follows 

that  for  a  =  1, 2, . . .  ,p*-k,  E  ,  ..A  (a:(T)  converges  to  zero. 

1  uj(n)  n  ’  J 

Also  note  that,  for  a,B  =  l,2,...,p+k. 


-  I  /n(uj  (n)-6  )  B (a ,B ;6 )  | 
B-l  8  6  J 

_ 

K  (6  )  — “TJjf 
n  ' 


H  («[■)•»,)(.(")•»)  1-  L(x;;in))  Is-  i°8 


CoVu(n)  lAn^;8)*An(8;0)l  =  Covar^(n)  (An(a;e),An(g;6)] 


E<4n)[Va;9)An(P;e)J  "  E,.,rn  jAn(a;0)  JE  ^  (An(B;6)] 


u-(n) 1  nv  ’  J  w(n) 1  n 


/  jf-  log  f(x;8)  log  f(x;0)  f(x;u>(n))dx 

■“a  6 
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l>»lng  algebraic  stmpl  1 1  uat  ion  a*  lor  L  [A  (u;oj),  anil  following 

win;  n 

the  assumptions  made  in  Section  2.1,  it  can  be  shown  that  the  covariance 
between  A^(a;v)  and  A^(i*;<>)  converge)  to  B(u,t*;o)  for  all 
win;  . 

n 

The  asymptotic  joint  normality  of  A^ ( I ;o) ,A^(2;o) , . . . # A^ ( p • h ; « ) 
is  a  standard  result  used  frequently  in  the  literature  (See,  for  example, 
p.  Soo  of  Cramer  (13)).  Using  that,  the  proof  of  the  lemma  is  hence 


complete. 


Q.1..D. 


Substituting  from  (.'  ll)  in  (2.26),  we  obtain,  for  i,)  •  1,2,... ,k, 


(2.36) 


j;(i|))  • 


f  -  P*h  _  ^ 

exp  /-2cA  (p*t;?)  -  -  i\  /n(6  )/n(<»  -0  )B  (a,?;?)  ) 

)  n  2  M-p.l  a  4  *  “  "  llj 

exp  f-2cAn(p.j;M)  -  y  R  ««.  ;j-0a)/n(0--0ri)Bn(a,e;&)|  ^ 

V.  u,B»p*l  H.  I 


M(*)  M  (C) 
n  n 


P  _  _  P*h 

}  “  (A „ (3 ; H)  ♦  l  /n( 


w  (n)-o^ )B(u,> ;o) 


■«„(*)  -M  (?)  p*k  ,  P 

•c|  l  Bn(a,v;5')-2Bnl  .,p«i;«)))-  y  ll  W3W(iBn(i,ti;«)  ) 
>»p*l  u,P*l 

dw  j , ,, , , 


.  e*p<  \  *-3 (^n ( »;^)  •  l  ^»(w^  (n)-*T  )8(u, ,  ;»•) 

j  j  j«1  »»1  1 

■M  (?)  -M  (?)  p»k  _  .  „p 

*c(  l  Bn(<i,>;o)-2Bn(u,p*j;0)))  -  J  l  l  * 

y»p*l  o,p«l 

dw  j , . . . , dw^ 


Hear ranging  terms,  we  get 


exp 


-2cA  (p*£ 


-  t  l  l  ^(o  -o)^(0„-e. 


n,6sp*l 


-  )B  (a , 6 ; 0) 

a  a  VB  b  n 


H 


£  - 


exp 


-2cAn(p^j 


;0)  -  J  f  I  ^(ea-9a)*/n(Og-0B)Bn(a,6;0) 

u,8«p*l 


H. 


exp 


•J  ^  lw.-tl,,l(n))(“8-t8,«(n))“n'a'B;,i)]  dkl . 


dw 


txp 


•  y  (l,  <Vta,j(n))<Vte,)(n))“n(<'"fi’®)] 

(-i  p  L  *  1 


dw(,...,dwp 


where,  for  a  *  l,2,...,k,  £  ■  l,2,...,k. 


la,i(n) 


(2.38) 


b»  1 


p*k 

A  ( p ; 0 )  ♦  [  /n(wln)4  )B(M;e) 

Y-l 

-c(  '  **  ( 0 , y ; v )  -  B  (o,p**.;0)] 

*-?♦!  n  n 

V1 


and 


( 2 . 3‘J ) 


A  (n) 
1 


1  r^y 

7  /.  i 

u.iul 


i(n)t[,  .  (n)Un(a,t<;tf) 


If  we  let  u  *  w  -t  . (n)  and  v  ■  w  -t  (n),  then  after  some 
i  u  a  , »  u  a  .i,  j 

simplification,  we  obtain,  for  i,j  ■  1,2,... ,k. 
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(2.40) 


du 

P 


dv 

P 

In  order  to  show  that  J^(£.|j)  is  arbitrarily  close  to  Jj|(z|j), 
it  is  sufficient  to  show  that  the  integrals  in  the  numerator  and  denomina¬ 
tor  of  (2.4)  arc,  for  sufficiently  large  n,  with  any  probability  less 
than  one,  within  an  arbitrary  positive  constant  e  of  their  common 
limit  (2n ) p/ 2  | B |  ,  where  | B |  is  the  determinant  of  the  matrix 

B,  whose  (a.B)1^  element  is  l}(a,6;tT).  For  that  it  suffices  to  show 
that  the  limits  of  integration  converge  stochastically  to  (-*>,^)  as 
n  ♦  *>. 

This  will  be  sufficient,  because  then  by  replacing  the  limits  of 
integration  by  (-•',*),  the  effect  on  the  numerator  of  J^(t|j)  is  to 
multiply  it  by  q^U)  where  qn(£)  converges  stochastically  to  one. 
Similarly,  the  effect  in  the  denominator  is  to  multiply  it  by  qnO), 
where  <ln(j)  converges  stochastically  to  one. 


exPl-  \  fl  u„UflBn(a»B:0^dUl1 
1  a, 6=1  B 


.00  -Mn(e'-t  (n) 
n  l  ,*  n  p,£ 


M  (0)-t .  .  (:i)  M  (o')  - 1  .(n) 

nv  1,3V  n)  p,3v 


exP(-  \  fl  vaV:n(a»B;0)ldvr 

cr,6  =  l 


■M  (0 )  —  t ,  .(n)  -M  (  7)  -t  .Fn) 
n  l,]v  n  p,3 • 
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From  (2.38),  we  obtain  for  a  =  1,2, £  =  l,2,...,k. 


l  F  Co.ejti)  A  (3;e)  •  [  /n(u  (n)-«  )B(b,y;9) 
0=1  "  n  Y=1  Y  Y 


-c[  l  B  (0,y ;6)  -  B  (6,p+£;6) ] 

.  11  il 

Y=p+1 


P+k  P 

a  +  l  ^K(n)-0v)  I  [F  (a, 3; 0)B(8,y ;< 
Y=1  Y  Y  6=1 


(2.41) 


\,^n)  -  ‘a , £  +  lml  ^Y(n)-0Y)6n(a,Y) 


where , 


(2.42) 


4n(a,Y)  =  l  k'n(a,6;0)B(0,Y;e) 


and  t^  ^  remains  bounded  with  probability  approaching  one  as  n  -*  *>. 

Since  Fn(a,0;0)  converges  stochastically  to  F(a,8;0)  (a, 6  »  1,2,. ,,p) 
as  n  increases,  it  follows  that 


stoch . 


(2.43) 


0  a  ^  y 


1  a 


Y  -  1,2,... ,p. 


It  also  follows  that  <5n(u,  P*£),  £  =  1,2, ...,k  remains  bounded 

with  probability  approaching  one  as  n  -*  *•. 


(2.49) 
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That  it  is  not  difficult  to  find  such  a  sequence  is  illustrated  by 
the  simple  example  (0 )  =  (1+e  )”*  [Mn C©")  -  M^^(0)].  Using  (2.46) 

and  the  sequence  {L^fe-)}  as  selected  above,  we  get  that  with  probability 
greater  than  1  -  <J>, 

-NIC0-)  -  tQ  .  (n)  <  -MCS-)  +  K(4>)  +  L  (O’)  (1+e  ) 

(2.50)  =  -[Mn(6)  -  (l*en)Ln(e‘)]  *  K (q» ) 

-*•  -*  as  n  -+• 

and, 

(2.51)  M  (0”)  -  t  (n)  >  M  (§”)  -  K (4> )  -  L  (O')  (1+e  ) 

n  v*  i**  n  n  n 

-*■  30  as  n  »  . 

Using  the  above  result  and  lemma  2.2,  we  have  thus  proved  the 
following. 

Lemma  2.4  For  l ,j  =  1,2, ...,k. 

(2.52)  log  Jn(£|j)  =  log  JJ(l|j)  ♦  Zn(£|j) 

where  Zn(fc|j)  converges  stochastically  to  zero  as  n  •+  for  all 

parameter  points  w(n),  w(n)  e  R^1  ( ?) . 

l  (0; 
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* 


As  a  consequence  of  the  above  lemma,  in  order  to  study  asymptotic 
behavior  of  J  (t|j),  we  need  only  study  the  asymptotic  behavior  of 

Using  (2,28)  and  lemma  2.4,  the  Bayes  decision  rule  for  the  problem 
reduces  to  the  following: 

For  i  =  1,2, ...,k#  select  if 


2c(An(p+j;6) 

(2.53) 


-  An(p+£;9))  ♦  2c  H  A  (o;0)F  (a,B;6) 
a,  6=1 


B  (B,p+£;0)  -  B  (B,p+j 


2  b. 

♦  c  vn  (0 ;  £  |  j )  +  2n  (£  |  j )  >_  log  gJ-  j  =  1,2,.. ,,k 


where  vn(0;£|i)  is  given  by  (2.30). 

The  Bayes  decision  rule  divides  the  sample  space  into  k  mutually 
exclusive  and  exhaustive  regions,  each  of  which  a  particular  hypothesis 
is  selected.  This  is  so  irrespective  of  whether  f(';*)  represents  a 
joint  density. function  or  a  joint  probability  mass  function.  Using  the 
regularity  conditions  imposed  on  £(•;•),  we  do  not  have  to  worry  about 
the  ties  introduced  into  the  problem  due  to  equalities  in  relation 
(2.53).  Thus  the  results  obtained  in  this  section  are  not  restricted 
to  assuming  the  existence  of  a  joint  probability  density  function  for 
the  joint  distribution  of  the  random  variables. 

For  ft, j  =  1 ,2, . . . ,k,  let 
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(2.54) 


Wn(£,j)  =  An(P+j;0)  *  An(P+£;6)  +  I  l  Va; 0>Fn(a'6;8)  • 

a,e=l 


I. 


,  iin(3,p+£;6)-Bn(0,p+j;e) 


Rearranging  terms  in  (2,53)  and  using  (2.54),  the  Bayes  decision  rule 
reduces  to  the  following: 

For  i.  =  1,2, select  if 

b.  - 

(2.55)  Wn  (£  |  j )  >_  2c"  ( log  c  V(0  ;£  |  j)  +  Z^(£  |  j) }  j=l,2,...,k, 

i 


where,  Z^(K.|j) 
parameter  point 


converges  stochastically  to  zero  as 
u(n)  e  r£  ^  (0")  .  Also  |  j) 


n  -*•  co,  if  the  true 
is  a  non  random 


continuous  function  of  0  for  all  £ , j  (£ , j  =  1,2,... ,k).  V(0 ;£ | j )  is 

obtained  from  (2.30)  with  the  random  variables  in  (2.30)  replaced  by 
the  constants  to  which  they  converge.  Because  of  the  symmetry  introduced 
in  the  problem  by  redefining  the  parameters  4>  by  (2.1),  we  may  conclude 
for  the  problem  of  ranking  from  several  univariate  or  multivariate  popu¬ 
lations,  when  the  observations  between  populations  are  independent,  that 


(2.56)  V ( 9 ; £  J  j )  =  0  for  all  £,j  ( i,j  =  1,2,.. .,k) 


In  obtaining  the  above  result,  we  have  used  the  following  realtions 


(2.57) 


B(ot,8;0)  =  B(a,y;0)  a  =  l,2,...,p 

B(8,B;e)  =  BtY^je")  B,y  =  p+1  ,p+2, . . .  ,p+k 
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and 

(2..S8)  B(6,y;6)  =  0  6  t  y,  8,y  =  p+1 ,p+2, . . . ,p+k 


If  for  the  problem  of  ranking  from  a  single  mu.' bivariate  problem 
(for  which  (2.57)  holc'.u  but  not  (2,58)),  we  assume  that 


(2.59) 


B(S,y;e)  =  B(5,n;8)  B  t  y,  6  ^  n,  B,y,S,n  =  p+l,p+2, . . ,p+k 


then  (2.56)  still  holds.  From  now  on  we  assume  that  this  is  so  and  hence 
the  results  are  applicable  in  ranking  fiom  a  single  multivariate  popula¬ 
tion  for  cases  for  which  (2,5'))  holds. 

Thus  the  Bayes  decision  rule  may  be  rewritten  as  follows: 


For  H  =  1,2, ...,k,  select  if 

1  b  ■ 

(2.60)  Wn(*|j)  >  (log  jJ-*  (*!))) 


j  =  1,2, ...,k. 


To  study  the  asymptotic  behavior  of  the  Bayes  decision  rule,  we  proceed 
to  study  the  asymptotic  joint  distribution  of  the  random  variables  W  (i|j). 
The  asymptotic  distribution  is  given  by  the  following  theorem. 


Theo-em  2.5 

then  for  fixed 
asymptotically 
given  by 


If  w(n)  is  the  true  parameter  point  for  X,  w(n)  e  r!1  (e), 

V0j 

2  (e  =  1,2, ...JO,  iWn(a|j);  j  =  1,2, ...,k,  j  t  2}  have 

a  joint  normal  distribution  with  mean  and  covariance  matrix 
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p+k 

EW  (£|j)  =  [  •/n(w  (n)-e  )[g(p+j;y;F)  -  g (p+^ ; y; e") ]  j  *  £ 

y-P*l  Y  Y 


and 


Cov(Wn(£| j),Wn(£ji)}  =  s (0 ;£ , i ,  j)  i,j  t  l 

where  g(  )  and  s(  )  are  continuous  known  functions  of  F  (g  ( • )  and 
s(*)  defined  later  on  in  the  proof). 


Proof:  For  each  fixed  £,  are  linear  combinations  of 

An(a;0),  a  =  l,2,...,p+k;  it  fellows  from  Lemma  2.3  and  a  standard 
result  in  literature  on  linear  combination  of  random  variables  with 
asymptotic  joint  multivariate  normal  distribution  (see  for  example 
Rao  [40])  that  for  fixed  £,  { Wn  C ^  I  j )  ;  j  t  £  >  have  an  asymptotic 

joint  normal  distribution.  It  only  remains  to  determine  the  mean  and 
covariance  matrix. 

Rewriting  (£ | j )  in  terms  of  An(a;6),  we  get  from  (2.T1)  and 
(2.54), 


Wn(£|j)  = 

r 

An(p+j;0) 

< 

*  (An(a;0) 

r 

An  (p+£;s") 

- 

•  (A  (a; 9) 

V.  n 


p+k 

+  l  /n(u v(n)-0 v)B(p+j;y;0) 
y=l  Y  Y 

p+k 

+  l  /n(aj  (n)-0  )B(a,y;0)) 
y=l  Y  Y 

p+k 

+  l  /n(u  (n)-0  )B(p+£,Y;0) 

Y  =  1  Y  Y 

p+k 

+  l  ^n(«  (n)-ev)B(o,Y;0)) 
Y=1  Y  Y 


rPr 

l  l  Fn(a,6;0)Bn(B,p+j;0) 
a, 6=1 


l  l  F  (a,B;6)B  (6,p+£;0) 
a, 6=1  "  n 
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or. 


(2.61)  KnU|j)  =  Wn(t|j)  ♦  W^i}  (i  |  j)  ♦  W^2)(*|,) 


where. 


P 

WnU!j)  =  <An(p*j;e)  -  [  l  F  (a,0;0)A  (a;0)B  (0 

a, 6  =  1 

(2.62) 

_P_  _ 

-{An(p+«;6)  -  l  I  An(a;e)Fn(c.,B;e)Bn(B 
a  ,6=1 


(2.63) 

w^Vlj)  = 

p+k  _  _  p 

l  ^(“(n)-0¥ J(B(p>j;y;e)  -  £  [  B(a,y;0)F  (a,B;6)B 

Y3P+1  Y  Y  a, 0=1  "  ’ 

p+k  _  p 

l  /n(u  (n)-0  ){B(p+£,y;e)  -  l  l  B(o,y;e)F  (a,B;6)B 
Y=P+1  Y  Y  a, 6=1 


and 


(2.64) 

w*2)(*|j)  = 


[  B(p+j ,Y ;0) 


l-(B(p+£,y;e) 


fl  B(a,y ;0)F  (o,B;6) B  (0 
a, 8  =  1 


J> 

I  l  B(a,y;0)F  (af0;6)B  (0 
a, 6  =  1 


.r+j;0)) 

,p+£;0) } 

t(B,p*j  ;e~) } 

(6,P+£;e)} 

,p+j;e)] > 

> 

,pt-A;e)  ] 
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From  (2.42)  and  (2.43),  we  see  that  if  u>(n)  is  the  true  parameter 
point  for  X,  oj(n)  e  ^  (0 ) ,  then  for  all  2,y  (£,y  =  1,2,  ...,k), 

k 

(B(p+£,y;8)  -  £  J  B(a,y;9)F  (a,B;8)B  (S,p-*-£ ;  0) )  converges  stochastically 

e«  n  n 

=  1 

to  zero  as  n  increases. 

Clearly,  if  we  choose  a  sequence  {M  (8)}  converging  to  infinity 

P  p 

such  that  l  M  (0) | (B(p+2,y;0)  -  B(a,y;0)F  (a,3;0)B  (S,p+t;0)}| 

y=l  a, B=1 

converges  stochastically  to  zero  as  n  increases,  when  w(n)  is  the 

true  parameter  point  for  X  and  u.(n)  e  ^(e);  then 

n'' 

(2.65)  l/2)(£|j)  Zl°£l I  o 

n  « 

From  now  on  we  aseume  that  the  sequence  (Mn (6) }  (on  which  we  had 
placed  no  restriction  except  as  given  by  (2.3))  is  so  chosen. 

For  £  =  l,2,...,k,  let 

P 

(2.66)  g(p+£,y;e)  --  B(p+2,y;0)  -ll  B(o, y;e)F (a,B; 8)B(S,p+£; 6) 

a,  6  =  1 

y  =  p+l,p+2,...,p+k  . 

Then  from  (2.61),  (2.65)- (2.66) ,  it  follows  th  ’t  when  the  true 
parcmeter  point  for  u(n)  e  rT->(9),  the  asymptotic  joint  distri- 

V J 

bution  of  (£  |  j  )  is  normal  with  mean  of  J  j )  as  required. 


* 

» 
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For  £,j  ■  1(2(. ,.,k,  j  t  £,  let 

(2.67)  Wn(i|j)  -  Rn0)  -  Rn(0 
where,  for  £  -  l,2,...,k, 

p 

(2.68)  R  (£)  -  * n(p*£;0)  -  l  l  £n(a;0)Fn(a,B;0)Bn(8,k*£;0) 

a,6»l 

Then,  in  view  of  results  obtained  above,  asymptotically,  we  get 


Cov(Wn(l|j),Wn(t|i))  -  Cov(i?n(i|j),Wn(£|i)) 

-  E[R*(£)]  ♦  E[Rn(i)Rn(j)J 

-E(Rn(£)Rn(i)]  -  E[Rn(£)Rn(j)) 

Because  of  the  symmetry  in  the  problem,  we  get 

(2.70)  Cov  (IV  (#■  |  j ) ;  V.'  (£  |  i ) )  -  E[R*(£)J  -  E(R  (i)  ;P.  (i)J  . 

n  *i  n  ii  ii 


For  £  «  l,2,...,k,  let 


(2.71) 

and. 


pn(«>P+s-) 


l  F  (a,B;U)B  (B,p+£;'e) 
B»1 


F(a,6;e;D(8,p+£;9) 


(2.72) 


F(a,p+£) 


3=1 
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Then,  using  (2.67),  (2.68), 


Cov(WnU|j),Wn(i|i))  - 

(2.73) 


(2.71)  and  lemma  2.3,  we  get  for 

B(p*t,pn;e)  -  B(pn,p*i;e) 
B(p*t,pn;e)  -  B(p+£,p+j);e 


i  f  J, 


Denoting  the  right  hand  side  of  (2.73)  by  s(6;£,i,j)  completes  the 
proof  of  the  theorem. 

Q.E.D. 

Because  of  the  inherent  symmetry  in  the  structure  of  the  ranking 
problem  and  because  of  the  way  in  which  we  have  redefined  our  parameter 
(Eq.  (2.1)),  we  note  that  the  decision  variables  have  asymptotically  a 
normal  distribution  with  a  common  correlation  coefficient.  This  makes 
it  easier  to  compute  certain  probability  integrals  needed  in  computing 
the  Probability  of  Correct  Selection  and  thus  the  Bayes  risk  for  the 
problem.  Tables  of  integrals  for  equicorrelated  multivariate  normal 
variables  by  Milton  [34]  and  Gupta  [21]  can  be  used  to  obtain  the  Bayes 
risk  for  any  particular  prior  distribution. 

In  order  to  determine  an  asymptotically  miniruax  decision  rule  for 
the  problem  it  follows  immediately  from  Theorem  2.1,  Theorem  2.3  and 
(2.60)  that  because  of  the  symmetry  in  the  problem,  the  Bayes  decision 
rule  given  by  (2.60)  is  an  asymptotically  minimax  decision  rule  if  we 

.  Thus,  an  asymptotically  minimax 
of  artificial  k- decision  problems  is  as 

given  Deiow: 


set,  for  j  *  1,2, ...,k,  b^  » 
decision  rule  for  the  sequence 


* 


< 
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For  £  ■  1,2, ...,k*  select  if 

(2.73a)  W  (i|j)  >  0 


where  Wn(£|j)  is  given  by  (2.54).  The  decision  variables  have  a 
limiting  equicorrelated  multivariate  normal  distribution  with  mean  and 

Ln(e) 

covariance  matrix  as  given  by  Theorem  2.3.  Recalling  that  -  -*•  1 

Mn(?) 

as  n  •*  the  decision  rule  given  by  (2.73a)  is  an  asymptotically 
minimax  decision  rule  for  the  sequence  of  artificial  k-decision  problems. 


2.4.2  Asymptotically  Optima)  Procedure  for  the  "Real"  Problem 

The  above  sequence  of  problems  was  artificial  because  we  assumed 
0JI02* to  be  known.  We  now  proceed  to  the  real  problem,  where 
nothing  is  known  about  the  values  of  the  nuisance  parameters  (0j,...,0p); 
and  with  the  loss  function  as  given  by  (2.18),  we  want  to  develop  an 
optimal  procedure  to  select  the  best  population. 

We  first  define  some  additional  notation.  Let  0J (n) ,e2(n) , . . .  ,ep(n) 
be  estimators  based  on  X  of  0j,...»0p,  such  that  for  any  e  >  0, 
there  exists  D(e)  <  »,  such  that 


(2.74)  P  {/n|0 4(n)-0.|  <  D(c) 

y  2 » •  •  •  9  'p  9  *■  ^ 


i  =  1,2, ...,p)  >  1 


(II  is  true) 


where  {H  is  true)  implies  that  one  of  the  p  hypotheses  given  by 
(2.17)  is  true. 


It  is  worth  noting  hero  that  the  above  condition  on  the  estimators, 
is  only  a  consistency  condition,  certainly  desirable  for  any  estimator. 
Maximum  Probability  Estimators  of  (0j,,..,0  )  could  be  used,  although 
any  set  of  consistent  estimators  satisfying  (2.74)  would  suffice. 

For  notational  convenience,  let 

(2.75)  0(n)  -  (0o,01(n),02(n),...,ep(n),0p+1,...,9p+k) 

Also  define  for  £  ■  1,2,. ..,k, 

P  A 

(2.76)  T*  (a)  =  “IA  (p+£.;0  (n))  -  £  £  F(a,0;0(n))B(0,pH;6  (n) )  • 

n  n  a, 8*1 

*  An(a;0(n))} 

and, 

(2.77)  lnCi)  =  -An(p*£;6(n)) 

Also  define  for  £  +  j  (£,j  *  l,2,...,k) 

(2.78)  &n(l|j)  =  Tn(£)  -  Tn(j)  =  r  (£)  -  T'(j)  . 

Then,  an  asymptotically  optimal  decision  rule  for  the  problem  of 
selecting  the  best  population  is  given  by  the  following  theorem: 
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* 


Theorem  2.4  An  asymptotically  optimal  decision  rule  for  selecting 
the  best  population  is  given  as 

Select  H£  (n£  as  the  best  population)  if 

(2.79)  Tn(0  i  Tn(j)  j  i  l 

J'i  j  =  1,2, ...  ,k  . 

Proof :  It  suffices  to  show  that  the  decision  rule  defined  above  has  the 

same  asymptotic  probability  of  correct  decision  as  the  Bayes  decision  for 
the  artificial  problem,  which  was  also  shown  to  be  asymptotically  minimax. 
It  will  be  sufficient  because  then  the  decision  rule  has,  asymptotically, 
the  same  "risk"  as  a  Bayes  and  minimax  decision  rule  and  hence  is  an 
asymptotically  minimax  (optimal)  decision  rule. 

In  order  to  do  that,  we  obtain  the  asymptotic  distribution  of 
From  (2.78),  we  get 


(2.80) 

- -  p  - - 

A  (p+j ;0+0(n)-9)  -  \\  A  (a; 0+8 (n)-0)F(a,B;6 (n) )B(B,p+j ;0 (n)) 
"  a, 8=1  n 

1 -  P  1 - 

-  (A  (p+£  ;8+6 (n)-0)  -  l  l  A  (a;0+e(n)-e)F(a,e;e(n))B(0,p+a;6(n)} 
n  a, 8=1 

Expanding  around  9,  we  obtain  for  a  =  l,2,...,p+k. 


where  Rn(a)  converges  stochastically  to  zero  as  n  -+  <*  .  Substituting 
from  (2.81)  in  (2,80),  we  get 


(2.82) 
&n(l|j)  = 

An  (p+ j  ;0) 
(An(a;0)  + 

An(p+JL;e) 
(An(a;0)  + 


p+k  *  P 

+  l  t/n(ev-0v(n))B(p+j,Y;6)  -  l  l 
Y*1  Y  Y  a,  8*1 

p+k 

l  •/n(e-ev(n))B(a,Y;e)} 
y=l  Y  Y 


p+k 

l  ^(V6v(n^B(a*Yi6)} 

Y  =  1  Y 


F(a,B;0)B(8,p+j;0)  . 


F(a,B;0)B(0,p+£;0). 


+  Qn(Mj) 

where  (K- 1 j )  converges  stochastically  to  zero  as  n  increases. 


Rearranging  terms,  we  get 


(2.83) 


&n(*l j)  - 

r 

(An(p+j;6) 

/ 


-(An(p+£;0) 


l  l  F(a,0;0)B(8,p+j;e)A  (a; 0) ) 

0,6=1  n 

rPr 

l  I  F(a,0;0)B(0,p+£;e)A  (a;0)) 

a,6=l  n 


(BCP+j^YlQ)  m  ll  F(o,6;0)B(B,p+j;0)B  (a,Y;0)) 
P+k  A  n  a,6=l  " 

*  l  ^(0Y-eY(n))  p 

Y=P+1  -(^(P+t.YJO)  -  I  I  F(a,Bj0)B(0,p+£;0)B  (a,y;0)) 

a, 8=1 


♦  !  4T(.V- 


6  (n)) 


(Bn(P+j,Y;6)  -  l  l  F(a,6;8)B(B,p+j;e)Bn(a,y;0)) 
a, 6=1 


Y  y  p 


-(B  (p+i,Y;6)  -  ll  F(a,6;9)B(6,p+*;0)R  (a,y;0)) 

L  a, 6=1  n  > 


♦  Qn(a|j)  . 


Since  in  (2. S3),  Wn(Jljj)  is  of  the  same  form  as  ( £  j  j )  in 

(2.60)  and  because  of  the  properties  of  estimators  (0 ^ fn),...,0p(n)) 
(i.e.  (2.74))  it  follows  that  5i?n(Jl|j)  has  the  same  asymptotic  normal 
distribution  with  mean  and  covariance  matrix  as  riven  by  Theorem  2.3. 

Since  the  decision  rule  given  by  (2.79)  is  equivalent  to  selecting 
H  if  W  (fc|j)  >  0,  as  in  (2.73),  it  follows  immediately  that  the 
decision  rule  given  by  (2.79)  has  the  same  asymptotic  Bayes  risk  and 


is  hence  an  asymptotically  minimax  procedure  for  the  ranking  problem. 


Q.E.D. 
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In  the  development  of  an  asymptotically  optimal  procedure  for  the 
problem  of  selecting  the  best  population,  we  considered  a  restricted 
parameter  configuration  (as  given  by  (2,17)).  The  general  parameter 
configuration  may  be  represented  as  follows: 


For  l  =  1,2, ...,k, 
r 


(2.84) 


e  „  =  e*  .  -  c/rfT 

p+2  p+£ 


V  < 


8  .  =  F  .  ♦  c.//n  i  =  1,2,..  ,,k 

L  P+J  P+J  3 

/  £ 


where,  c^  ^  c  >  0. 

A  word  of  explanation  is  in  order  here.  In  (2.84),  hypothesis 

implies  that  n  is  the  best  population;  and  the  distance  between  the  best 

c+c . 

population  and  the  other  populations  are  - 1-  ,  (j  =  l,2,...,k,j  +  Jt). 

/n 

For  the  case  where  c.  are  all  equal,  we  get  the  restricted  configuration 
given  by  (2.17)  and  the  case  where  one  or  all  of  c^  are  different,  we 
get  a  general  parameter  configuration. 

Now,  we  must  show  that  the  procedure  given  by  Theorem  2.4  is  indeed 
asymptotically  minimax  over  the  parameter  space.  This  follows  very  easily 
since  for  a  general  parameter  configuration  given  by  (2.80),  the  proba¬ 
bility  of  correct  selection  increase,  whenever  for  any  j,  Cj  >  c.  This 
implies  hence  that  the  configuration  given  by  (2.17)  is  the  least 
favorable  configuration  and  hence  the  procedure  given  by  Theorem  2.4  is 
indeed  asymptotically  miniraax. 


-64- 


2.5  The  Rate  of  Convergence  of  Decision  Variables  to  an  Asymptotic 
Normal  Distribution 

In  our  analysis,  we  have  shown  that  our  decision  rule  (procedure) 
is  asymptotically  optimal.  In  order  to  completely  specify  the  test, 
asymptotic  normality  of  the  decision  variables  is  used.  It  would  be, 
both  from  a  theoretical  and  practical  viewpoint,  very  useful  to  determine 
the  rate  of  convergence  to  the  asymptotic  normal  distribution.  From  a 
practical  viewpoint,  to  an  experimenter,  who  for  large  sample  size  n, 
would  act  as  if  the  asymptotic  distribution  is  the  actual  distribution 
(which  it  is,  to  a  close  approximation)  it  would  be  helpful  to  know  how 
fast  the  asymptotic  results  take  over.  The  results  on  the  rate  of  con¬ 
vergence  would  supply  that  information. 

In  this  section,  we  present  the  main  results  on  the  rate  of  con¬ 
vergence  in  Central  Limit  Theorems  for  a  set  of  independent  random 
variables  (one  dimensional  or  multidimensional).  The  results  on  rate 
of  convergence  for  one  dimensional  random  variables  are  useful  for  the 

problem  of  ranking  two  populations,  which  can  be  easily  shown  to  be  a 

one  parameter  problem.  The  results  for  the  one  dimensional  case  would 
equally  well  apply  to  the  paper  by  Weiss  and  Wolfowitz  [55], 

Rate  of  Co:.vergerce  in  '.im;  dimensional  Cc”.;  ral  Li. nit 'Theorem 

Let  X^X^,..  .  «  be  i)  dependent  and  identically  distributed 

random  variables  with  EX  =  0,  Var  X  =  1  and  E|x|  <  °°.  Denote 
n  x  i  .  2  j  M 

£  X.  by  S  and  /  —  —  e_t  '  dt  by  $(x).  Let 

i=l  1  n  An 
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S 

(2.85)  A  =  sup  P{  —  <  x}  -  4>(x) 

n  x  &  ” 

We  state,  without  proof,  the  following  theorems  (on  the  rate  of  con¬ 
vergence  in  a  central  limit  theorem)  of  Berry  [10],  Esseen  [18]  and  Katz  [28] 
in 

Theorem  2.5  (Berry-Esseen) 

(2.86)  A  <  — 1X13 

n  -  Sn 


where  C  is  an  absolute  constant. 


Theorem  2.6  (Katz) 

(2.87)  A  <  C. 

"  ~  1  g(/5) 

where  is  an  absolute  constant  and  g(x)  is  defined  on  the  real  line 
and  satisfies 

i)  g(x)  >_  0,  even,  non-decreasing  in  [0,°°)  and 
ii)  x/g(x)  is  defined  for  all  x  and  non-decreasing  on  [0,“). 

Theorem  2.6  is  useful  if  one  cannot  make  the  assumption  about 
3 

E | X |  <  00 ,  but  would  be  much  harder  to  apply,  since  the  function  g(x) 

has  to  be  specified.  In  case  one  can  assume  that  E|x|^  <  ®,  Theorem 

2.5  would  be  very  convenient  to  use. 

We  note  that  by  the  Central  Limit  Theorem,  -*•  0  and  from  (2.86), 

-1/2 

An  converges  at  a  rate  of  n  .  The  constant  C  in  Theorem  2.5  is 
given  by  the  following  result  due  to  Zolotarev  [56], 


.JJIPIJII  J.'IPH 


.  rmtmm 


■V 


* 
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Theorem  2tn  (.Zolotarev)  In  Theorem  2.5, 

(2.88)  C  <  1.301 


In  our  analysis,  the  random  variables  {X^,  i  =  l,2,...,nj  would 

be  replaced  by  a  suitable  function  of  the  original  observations,  «i  defined 

by  the  decision  variables  l’nU)  (3ee  (2.78)).  If  besides  the  regularity 

conditions,  we  also  assume  the  existence  of  the  3rd  absolute  moments, 

we  can  use  Theorem  2.5  and  Tvuorem  2.7  directly.  The  rate  of  convergence 

-1/2 

is  of  the  order  of  n  and  it  the  3rd  absolute  moment  of  the  normalized 
decision  variables  is  small  (it  depends  on  the  exact  parametric  form  of 
the  probp.bility  density  function  of  the  population*  being  ranked) 
large  sample  results  will  be  effective  quite  r^pidiy. 


of  r ^.H/prgence  in  Multidimensional  Central  Limit  Theorem 
For  the  general  ranking  problem  for  k  populations  (k  >_  3), 
we  need  multidimensional  analogues  of  the  Berry-Esseen  Theorem. 

Let  Xj,X2»...,X  be  i.i.d.  random  variables  (X.  =  (X^,  X2i-’Xki» 
each  with  distribution  function  F.  Let 


(2.89) 


E(Xti-Ut)2  =  ,2 

E|*ti  *  ^l3  ■  St 
at  *  ~m 

CJ 


for  t  =  1,2, ...,k 


and 
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i'.Iso  let  A  be  the  detem  ^ant  of  the  covariance  matrix  and  A  the 

corresponding  principal  minors  oi  A  (t  *  1,2, Let 

n 

3  --  y  / .  and  let 

n  i 

i=l 


(2,90 ; 


F  (\;  •  P{—  <  X) 

n  /n  - 


where  S  and  x  are  k  dir,<  jsicnal  vectirs.  Let  G(x)  denote  the 
n 

cdf  of  normal  random  variable-?  “iLli  same  first  and  second  moments  as 

that  of  i  (.). 

n'  J 

Using  the  above  notation,  we  have  the  following  theorem  by  Sazanov 
[42]  (stated  without  proof)  on  the  rate  of  convergence. 

Theorem  2,9  (Sazanov) 


(2.91) 


k  A 

sup  |Fn(x)  -  G(x)|  <  C2(k)  (  l  -Hott)n‘1/2 

xcRk  t=1 


As  before,  we  note  that  the  rate  of  convergence  is  of  the  order 

-1/2 

n  ,  and  if  the  constant  terms  in  (2.91)  are  small  enough,  asymptotic 
results  would  be  effective  quite  rapidly.  It  may  be  noted,  that  for 
our  analysis,  since  the  decision  variables  ??n(2.|j)  (those  rate  of 
convergence  is  being  determined)  have  the  same  correlation  coefficient 
and  the  same  variances,  in  (2.91)  we  would  get 


(2.92) 


A  =  A 

tt 


°t  =  ° 


t  *  1,2, ... ,k 


and  hence  a  certain  simplification  would  be  achieved  in  determining  the 
right  hand  side  of  (2.91). 


2.6  Examples 

In  this  chapter,  we  have  developed  asymptotically  optimal  procedures 
for  selecting  the  best  population,  when  the  density  function  of  the 
observations  satisfies  certain  mild  regularity  conditions  (as  given  in 
Section  2.1).  It  is  interesting  to  note  that  for  a  large  class  of  density 
functions  (satisfying  the  above  mentioned  regularity  conditions),  an 
asymptotically  optimal  procedure  for  selecting  the  best  population  (as 
given  by  Theorem  2.4)  takes  such  a  simple  form.  The  populations  are 
ranked  according  to  the  value  of  the  statistic  T^  (as  given  by  (2.77), 
the  population  associated  with  the  largest  value  of  T£  being  selected 
as  the  best  population. 

We  now  give  some  examples  to  illustrate  the  applicability  of  the 
results  to  specific  distributions.  The  first  set  of  examples  indicates 
the  asymptotic  optimality  of  selection  procedures  already  proposed  in  the 
literature.  In  the  second  example  we  give  an  asymptotically  optimal 
procedure  for  selecting  the  normal  population  with  the  largest  mean,  the 
populations  having  common  unknown  variance.  We  also  indicate  how  the 
asymptotically  optimal  procedure  would  be  used  in  practice  for  large 
sample  sizes.  In  the  third  example,  we  develop  an  asymptotically  optimal 
procedure  for  the  problem  of  ranking  Weibull  distributions  according  to 
the  value  of  the  scale  parameter,  when  the  populations  have  known  location 
parameters  and  common  (known  or  unknown)  shape  parameter. 


1.  Asymptotic  Optimality  of  Certain  Selection  Procedures  already  Proposed 

in  the  Literature 

As  a  consequence  of  the  results  in  this  chapter,  the  selection  pro¬ 
cedures  already  proposed  in  the  literature  for  the  following  problems 
are  asymptotically  optimal  (the  references  in  brackets  indicate  the 
papers  in  which  the  procedures  were  proposed): 

(i)  Ranking  variances  of  normal  populations  with  known  or  unknown 
means  (Bechhofer  and  Sobel  [8)). 

(ii)  Selecting  the  best  of  several  binomial  populations  (Sobel 
and  Huyett  [44]). 

(iii)  Selecting  the  multinomial  event  with  the  highest  probability 
(Bechhofer,  Elmaghraby  and  Morse  [6]). 

(iv)  Selecting  the  bivariate  normal  population  with  the  largest 
correlation  coefficient  (Ramberg  [39];  also  given  as  an 
example  in  Eaton  [17]). 

(v)  Selecting  the  component  with  the  largest  mean  in  ranking  from 
a  single  multivariate  population  with  common  known  variance 
and  covariance  of  the  components.  (Given  as  an  example  in 
Eaton  [17]  and  Milton  [34]). 

2.  Ranking  Means  of  Normal  Populations  with  Common  Unknown  Variance 

An  asymptotically  optimal  procedure  for  selecting  the  normal 
population  with  the  largest  mean,  populations  having  a  common  unknown 
variance,  is  to  select  the  population  associated  with  the  largest  sample 
mean. 

The  procedure  is  the  same,  as  one  may  expect,  as  one  proposed  by 
Bechhofer  [4]  for  ranking  means  of  normal  populations  with  common  known 


* 

* 
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variance.  However,  there  is  a  difference  in  the  probability  statements 
one  can  make,  depending  on  whether  the  common  variance  is  known  or 
unknown.  In  case  the  common  variance  is  known,  the  experimenter 
specifies,  prior  to  experimentation,  two  constants  (6*,  P*),  with 
6*  >  0,  ^  <  P*  <  1.  The  sample  size  (n)  needed  to  guarantee  the 
probability  requirement  is  given  as  a  solution  to 


(2.93) 


6*/n 

o 


A(P*) 


2 

where,  o  is  the  common  known  variance  and  A(P*)  is,  as  given  in 
[4],  the  solution  to 


(2.94) 


/  Fk_1(x  ♦  A(P*))f (x)dx  =  P* 


with 


f(x) 


2 


x 


and 


F(x) 


/  f(y)dy 


»oo 


A(P*)  is  tabled  in  Bechhofer  [4]  (also  in  Milton  [ 34 }  and  Gupta  [21]) 
and  thus  one  determines  the  required  sample  size  n(6*,P*)  for  any 
prespecified  constants  (6*,P*). 
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In  case  the  common  variance  (o  )  is  unknown,  for  a  sample  of 

-2 

size  n,  the  experimenter  computes  a  consistent  estimate  a  (n),  say 

2 

the  sample  variance,  of  the  unknown  variance  a  and  acts  as  if 
*2  2 

o  (n)  were  the  actual  variance  o  ,  For  any  sample  size  n  and  the 
a  2 

computed  o  (n),  one  determines  the  pair  A(6*,P*)  satisfying  (2.93). 
Then,  for  a  specified  6*  one  finds  the  probability  (P*)  that  is 
guaranteed,  (or  for  a  given  P*  one  can  find  the  6*  for  which  P* 
is  guaranteed.) 


3,  Ranking  Scale  Parameters  of  Weibull  Distribution  with  Known  Location 
Parameter  and  Common  (known  or  unknown)  Shape  Parameter 


For  a  random  variable  X  having  a  Weibull  distribution,  the  cdf 
G(x;9)  is  given  as 


(2.95) 


G(x;0) 


I-°i  |‘/93' 

*2  I 

/ 


x  <  6 

x  >  0 

~7 


1 

1 


where  0^  is  the  location  parameter,  @2  is  the  scale  parameter  and 
0^  is  the  shape  parameter,  0  =  (©i , ©2* 6^)  and  the  parameter  space 
q  =  {e|-<*>  <  9j  <  »,  >  0,  9^  >  0}. 


In  case  the  location  parameter  0^  is  known,  we  may  take  0j  to 
be  zero  with  no  loss  in  generality  and  get 


(2.96) 


G(x;0) 


r 


0 


1  -  exp 


x  <  0 

x  >  0 
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where  8  *  C6 2 * e 3^  anc*  tbe  Parar,,etflr  space  ft  a{6|02  >  0  ,  >  0}. 

Defining  the  random  variable  Y  as  log  X,  denoting  bY 

D 

e  and  denoting  (B,  8j)  by  8  we  find  that  the  cdf  P(y;6)  of  the 
random  variable  Y  is  given  as 


(2.97) 


F(y;8)  =  1 


exp 


{• exp(^} 


For  the  random  variable  Y  having  cdf  as  given  by  (2.97),  B  is  the 
location  parameter  and  8^  is  the  scale  parameter.  Thus  the  problem 
of  ranking  scale  parameters  of  Weibull  distributions  reduces  to  the 
problem  of  ranking  location  parameters  of  a  distribution  as  given  by 
(2.97). 

The  density  function  f(y;0)  of  observations,  with  cdf  as  given 
by  (2.97),  is  given  as 


(2.98)  f(y;6)  *  exp  (£jp)  exp  (-exp  (£p)) 

3  3  3 

For  the  ranking  problem  at  hand,  with  density  function  of  obser 
tions  given  by  (2.98),  the  regularity  conditions  of  Section  2.1  are 
satisfied.  Thus  the  results  obtained  earlier  in  this  chapter  hold  and 
an  asymptotically  optimal  procedure  is  as  given  by  Theorem  2.4. 

Let  Xti  (i  *  1,2, ...,n)  denote  independent  observations  from 
population  IIt  (t  *  1,2, ...,k)  each  with  cdf  given  by  (2.96).  Let 
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(2.99) 


log  X 


ti 


i  —  1,2,. ..,n 
t  *  1,2, ...,k 


and,  for  £  =  l,2,...,k 

n  l/e 

(2.100)  T  =  l  (X  ) 

i=l  11 

Let  the  ranked  value  of  be  denoted  by 


(2.101) 


f  <  T  <«•!!/  T 

’[l]  I  t[2]  i  i  T[k] 


For  selecting  the  Weibull  population  with  largest  scale  parameter, 
when  the  location  parameters  are  known  and  the  populations  have  a  common 
known  shape  parameter  (e3),  as  asymptotically  optimal  procedure  (as 
given  by  Theorem  2.4)  is  to  select  the  population  associated  with  T^j. 
Let  Zj  <_  <_  •  •  •  •  Z  ^  denote  the  ordered  values  of 

i  =  1,2,..  .,n,  t  =  l,2,...,k).  Then,  as  given  by  Weiss  (50], 

nk+i  nk'1 

(2.102)  03  =  -2^.  ^  {(l-j/nk)log(l-j/nk)}(Z.  +  1  -  Z.) 


is  a  consistent  estimate  of  0^. 
For  £  =  1,2,. ..,k,  let 


1/6j 

r£  *  l  <V 

1*1 


(2.103) 


where  e_  is  as  given  by  (2.102).  Let  the  ordered  values  of  T  be 
denoted  by 


(2.104) 


T  <T  <  ••••  <  T 

[1]  -  [2}  -  -[k] 


For  selecting  the  Weibuil  distribution  with  largest  scale  parameter, 
when  the  location  parameters  are  known  and  the  populations  have  a  common 
unknown  shape  parameter  (6^),  an  asymptotically  optimal  procedure  (as 

A 

given  by  Theorem  2.4)  is  to  select  the  population  associated  with 


i 


i 


CHAPTER  3 


ASYMPTOTICALLY  OPTIMAL  PROCEDURES  FOR  CERTAIN  ADDITIONAL  RANKING  GOALS 


In  the  last  chapter,  we  have  developed  asymptotically  optimal  pro¬ 
cedures  for  selecting  the  best  population,  for  situations  in  which  the 
joint  density  function  of  the  observations  satisfies  certain  mild  regularity 
conditions.  In  this  chapter,  we  extend  the  basic  results  to  develop 
asymptotically  optimal  procedures  for  certain  other  ranking  goals. 

The  following  two  general  ranking  goals  have  been  considered  in  the 
literature: 

(i)  Selecting  a  subset  of  site  s  to  contain  at  least  d  of  the 
t  best  populations,  with  max  (l,s*t+l-k)  ^  d  ^  min  (s,t) 

(which  implies  max(s,t)  <^k-l)). 

d,s,t  are  integers  specified  prior  to  experimentation. 

(ii)  Select  the  kg  "best"  populations,  the  kg  "second  best" 
populations,  the  kg_2  "third  best"  populations,  etc.,  and 
finally  the  kj  "worst"  populations. 

s 

kj»k2,...,ks  (s  <  k)  are  integers  such  that  £  k^  «  k 
and  (kj,k2,...,ks,s)  are  specified  prior  to  experimentation. 

Bechhofer  [4]  in  his  paper  alluded  to  the  ranking  goal  (i),  but  it 
was  formulated  and  formally  considered  by  Mahamunulu  [33]  and  Desu  and 
Sobel  [14],  It  has  been  called  Goal  I  by  Mahamunulu  [33]  and  we  too 
shall  use  that  de_  ignalion.  Two  particular  cases  of  Goal  I  are  of 
special  interest:  (a)  Selection  of  a  subset  of  size  s  (>t)  which 
contains  the  t  best  populations,  and  (b)  Selection  of  a  subset  of 
size  s  (<t)  which  contains  any  s  of  the  t  best  populations.  In 
Section  3.1,  we  consider  a  special  case  of  (a),  with  t  »  1.  We  develop 
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an  asymptotically  optimal  procedure  for  this  special  case  of  Goal  1.  This 
case  is  treated  in  detail,  since  it  brings  out  the  main  steps  needed  in 
extending  the  basic  results  in  Chapter  2  to  develop  asymptotically  optimal 
procedures  for  more  general  ranking  goals.  In  Section  3.2,  we  treat  a 
special  case  of  (b)  with  s  *  1.  We  develop  an  asymptotically  optimal 
procedure  for  this  special  case  of  Goal  I.  In  Section  3.3,  we  give, 
without  proof,  an  asymptotically  optimal  procedure  for  Goal  I.  Bechhofer 
[4]  formulated  goal  (ii)  and  considered  two  cases  in  detail,  namely, 

8  "  2,  .  k-t,  k2  «  t  and  s  -  fl,  kj  -  k-t,  k2  *  k3  =  ...  =  kt+J  =  1. 

For  the  second  case,  we  are  interested  in  selecting  the  t  best  popula¬ 
tions  with  regard  to  order.  Bechhofer  [4)  considered  this  special  case 
and  provided  tables  (to  be  used  to  determine  the  sample  size  needed  for  the 
proposed  single-stage  procedure)  for  the  case  k  *  t  *  3.  This  special 
case  of  goal  (ii)  has  been  formulated  as  Goal  II  in  Bechhofer,  Kiefer  and 
Sobel  [7]  and  we  refer  to  it  as  Goal  II  too.  In  Section  3.4,  we  obtain 
an  asymptotically  optimal  procedure  for  Goal  II.  In  Section  3.5,  we  point 
out  certain  other  possible  goals  in  ranking  and  selection  problems.  The 
method  developed  in  this  thesis  could  be  adapted  to  these  cases. 

3.1  Selecting  a  Fixed-Size  Subset  to  Contain  the  Best  Population 

In  order  to  develop  an  asymptotically  optimal  procedure  for  selecting 
a  fixed-size  subset  to  contain  the  best  population,  we  formulate  the 
problem  in  decision  theoretic  structure.  For  i  *  l,2,...,k,  let 
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(3.1) 


H.  : 

1 


e  •  ♦  c/v'n 
P+J 


p+i 


-  c//n 


j  =  1,2, ...,k 
i  i 


where  c  >  0  and  6  .  =  0  all  i,i  (i,i  =  1,2,. ..,k)  are  known  values. 

p+j  p+i 

o’  j  may  be  taken  to  be  zero  with  no  loss  in  generality. 

In  view  of  (2.1),  (3.1)  represents  a  restricted  parameter  configura¬ 
tion,  in  which  if  is  the  true  state  of  nature,  then  IL  is  the  best 
population. 

We  now  describe  the  basic  method  used  in  developing  an  asymptotically 
optimal  ranking  procedure  for  this  problem.  Tn  order  to  develop  such  a 
procedure  for  the  ranking  problem,  we  develop  an  asymptotically  optimal 
procedure  for  an  associated  identification  problem,  with  e^,  the  common 
location  of  the  ranking  parameter,  as  the  least  favorable  location  of  the 
ranking  parameter.  We  first  solve  the  problem  for  a  restricted  parameter 
configuration,  given  by  (3.1),  and  then  show  that  the  procedure  developed 
is  minimax  over  all  parameter  configurations. 

Let  D(I)  denote  the  decision  to  select  n.,n.  ,..,,n.  as  the 

*1  2  s 

subset  (of  size  s)  containing  the  best  populations,  where  I  =  {  (i^i^, . . ,ig) 
it  i  it.»  t*t'  ■  1,2, ...,s,  i ^ , i t ,  =  l,2,...,k).  I,  the  index  set, 
is  a  s-tuple  whose  components  inidcate  the  s  populations  included  in 


the  selected  subset. 

The  loss  function  for  the  multiple  decision  problem  is  given  as 
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(3.2)  W(0;D(I)) 


0  if  H.  or  H.  or  is  the  true  hypothesis 

12  xs 


1  otherwise 


where  I  *  (i^i^,.. . ,ig) . 

k  k 

There  are  (g)  possible  decisions,  and  we  thus  have  a  (s)-decision 

problem  with  simple  loss  function  given  by  (3.2), 


Preliminary  Sequence  of  Artificial  Problems 


We  now  develop  an  asymptotically  optimal  procedure  for  a  sequence  of 


artificial  (* )-dccision  problems  (one  for  each  n). 


Suppose  it  is  known  that  0^  - 


m  (e) 


<  6.  <  0.  + 

—  l  —  l 


Ve> 


i  =  l,2,...,p  and  (6p+i»6p+2* • • • »ep+k^  satisfy  one  of  the  hypotheses 
given  by  (3.1).  *  *  * »®p  are  known  constants  and  0  <  c  L.  We 
wish  to  select  a  subset  of  size  s,  and  the  loss  function  is  given  by 
(3.2). 

For  the  above  problem,  we  construct  a  Bayes  decision  rule  relative 
to  the  following  prior  distribution: 


For  j  =  l,2,...,k,  a  total  mass  of  bj  is  spread  uniformly  over 


the  set 


M  (9)  _  M  (9) 

(3.3)  6.  -  <  0.  <  0.  +  (i  =  1,2, ...,p) 

/n  *'n 


and  IL  is  true  where;  for  j  =  1,2, ...,k,  we  have  >_  0,  and 
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&b>  *  1  ' 


The  prior  distribution  (b^,  j  =  l,2f...,k)  is  arbitrary,  but  fixed. 
Later  we  select  the  prior  in  such  a  way  as  to  obtain  a  minimax  decision 
rule  for  the  problem  at  hand. 

Before  we  develop  a  Bayes  decision  rule,  we  define  some  notation 
used  later  on.  For  j  =  l,2,...,k,  let 


M  (6)  M  (0) 

¥,♦— -  *  ♦— - 

1  rfT  p  /K 


(3.4)  b.(n)  «  b. 


n  f(X. ;0,n.)de. ,...,de 


M  (8)  _  M  (6)  1 

e.-— —  e  — 

1  a  p  ^ 


V’ 


Also  let. 


(3.5) 


IV 

>  .(n)  *  b  (n)/  [  II  (n) 

J  J  t«i 


Then,  we  may  view  b .  (n)  as  the  posterior  probability  for  the 
hypothesis  ,  if  we  consider  b^  as  the  prior  probability. 

In  terms  of  notation  of  Chapter  2,  for  £,j  *  1,2,. ..,k. 


(3.6) 


b.  b  (n) 

-  E^  E^tf 


Let  the  ranked  values  of  b^  (n)  be  denoted  by 
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(3.7) 


b [  1 1  (°)  -  b[2]^  i  .  -  b[k]fn^ 


In  order  to  develop  a  Bayes  decision  rule  for  this  problem,  we  compute 
k(D(I);x)  for  each  I  and  (in  view  of  the  results  in  Section  2.2)  use 
that  to  construct  a  Bayes  decision  rule. 

For  I  *  (ij.ij,. . . ,i  ) , 


k(D(I);x) 

(3.8) 


k 

l  b .  (n)  -  b.  (n)  -  b.  (n)  -  b.  (n)  -  ...  -  b.  (n) 
j-1  3  *1  *2  *3  *3 

k 

l  b  (n)  -  l  b  (n) 
j«l  3  itcl  xt 


From  (3.8),  it  follows  directly  that  a  Bayes  decision  rule  is  to 
select  the  s  populations  associated  with  the  s  largest  values  of 
bj(n).  In  view  of  (3.7),  a  Bayes  decision  rule  is  to  select  populations 

]»••••  fb^j  (n) ) . 


associated  with 


Using  (3.6),  the  Bayes  decision  rule  can  be  seen  to  reduce  to  the 
following: 

Select  n  ,n.  ,...,n  as  the  subset  containing  the  best  population 
1  X2  ls 


if 


b. 

i  & 

ii 

b. 

(3.9) 

yi2lj> 

>  i . 

-  b 

for  j  i  (i1,i2,...,is) 

e 

• 

Jnasi;> 

*2 

b. 

i  ^ 

j  *  1,2,... ,k 

s 
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The  decision  rule,  as  defined  by  (3.9),  divides  the  sample  space 

k 

into  (  )  mutually  exclusive  and  collectively  exhaustive  regions,  on 
each  of  which  a  distinct  set  of  s  populations  will  be  selected  as  the 
best  subset.  Note,  as  a  check,  that  when  s  =  1,  (3.9)  reduces  to  the 
decision  rule  given  in  Chapter  2  for  the  problem  of  selecting  the  best 
population. 

We  now  obtain  asymptotic  properties  of  the  Bayes  decision  rule 
given  by  (3.9),  Since  the  assumptions  and  regularity  conditions  of 
Section  2.1  hold,  the  asymptotic  properties  of  Jn(i|j),  fc,j  *  1,2, ...,k, 
i  i  j ,  obtained  in  Section  2.4  hold.  Thus  Lemmas  2. 1-2.4  and  Theorem  2.3 
could  be  used  to  reduce  the  Bayes  decision  rule  to  the  following: 

Select  n.  ,n.  ,...,n,  as  the  subset  containing  the  best  population 
*1  *2  *s 

if 


(3.10) 


l  b 

wn(iiij)  i  rc(log  b~  *  z;(iilj}) 

Vi2|j>  i  *  W‘2!j» 

:  12 
:  ,  b 

wn(is|j)  »  irdoggj-.  z;(is|j)) 

s 

J  ^  (i 1**2*  *  *  * 

)  --  1,2 . k 


where,  as  before,  Z'(it|j),  (t  ■  l,2,...,s)  converges  stochastically 
to  zero  as  n  increases,  if  the  true  parameter  point  u(n)  e  ^(9). 
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Also  W  (ij|j)  are  as  defined  by  (2.54)  and  their  asymptotic  distribution 
is  as  given  by  Theorem  2.3.  Thus,  the  decision  variables  have,  asymptoti¬ 
cally,  a  multi-variate  normal  distribution  with  mean  and  variance  given 
by  Theorem  2.3. 

Using  Theorem  2.1  and  the  natural  symmetry  in  the  multi-decision 
problem,  it  follows  directly  that  the  Bayes  decision  rule  given  by  (3.10) 
is  an  asymptotically  minimax  decision  rule  if  we  set  b^  *  for 
j  *  l,2,...,k.  Thus,  an  asymptotically  minimax  decision  rule  is  given 
as: 


Select  n.  , n.  , ,.,,n.  as  the  subset  containing  the  best  population 
X1  l2  ls 


if 


(3.11) 


wn (i j  I  i )  1  0 
wn(i2lj)  i  0 


K„(islj)  i  o 


j  -  1.2 . k 


Recalling  that  as 


Ln(e) 


■*  1,  the  decision  rule  given  by  (3.11) 


is  an  asymptotically  minimax  decision  rule  for  the  sequence  of  artificial 


(^-decision  problems. 


Optimal  Procedure  for  the  "Real"  Problem 

The  above  sequence  of  problems  was  artificial  because  we  assumed 
,82, . . .  ,$p  to  be  known.  Ke  new  proceed  to  the  real  problem,  where 
nothing  is  known  about  the  values  of  the  nuisance  parauictcis  (0^  ,«2> . . . , 0  ) , 
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As  before,  let  d^iO^Cn),...^  (n)  denote  estimators  of 
e1,e2.....0p  respectively,  based  on  the  observations  (Xj,...,Xn).  We 
assume  that  these  estimators  sr-isfy  the  consistency  condition,  as  given 
by  (2,74);  one  may  use  Maximum  Probability  Estimators  (MPE)  of  (0j,...,0p) 
as  the  estimators  (6j(n) , . . . ,0p(n)) . 

For  notational  convenience,  let 


(3.12)  0(n)  *  (0o,01(n),02(n),...,0p(n),0p^1,...,0p>k) 


For  l  «  1,2,.. ,,k,  let 


(3.13) 


T^n)  *  -  An(p*t;0(n)) 


Denote  the  ranked  values  of  T£ (n)  by 


(3.14) 


T^jtn)  I  1(21 (n) *  *  *  *  *  I  Tfk] (”) 


Then,  an  asymptotically  optimal  decision  rule  for  the  problem  of  selecting 
a  subset  of  size  s  to  contain  the  best  population  is  given  by  the 
following  theorem: 


Theorem  3.1.  An  asymptotically  optimal  decision  rule  for  selecting  a 
subset  of  size  s  to  contain  the  best  population  is  to  select  the  s 
populations  associated  with  j  (n)  ,Tjk  (u j , . . .  ,T[k  (n)  ^  ] . 
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* 


Proof:  It  suffices  to  show  that  the  decision  rule  given  by  Theorem  3.1 

has  the  same  asymptotic  properties  as  the  minimax  decision  rule  (given 
by  (3.11))  for  the  sequence  of  artificial  problems.  That  follows  directly 
from  the  proof  of  Theorem  2.4,  completing  the  proof  of  the  present  theorem. 

Q.E.D. 

In  the  development  of  an  asymptotically  optimal  procedure  for  the 
ranking  problem,  we  considered  a  restricted  parameter  configuration  given 
by  (3.1).  The  general  parameter  configuration  may  be  represented  as 
(2.84).  As  in  Chapter  2,  it  follows  very  easily  that  the  parameter 
configuration  given  by  (3.1)  is  the  least  favorable  configuration  and  hence 
the  procedure  given  by  Theorem  3.1  is  indeed  asymptotically  minimax  over 
all  parameter  configurations. 

In  order  to  compute  the  probability  of  correct  selection  for  the 
selection  procedure  given  by  Theorem  3.1,  one  may  use  tables  given  in 
Desu  and  Sobel  [14]  and  Mahamunulu  [32]  for  some  special  cases. 
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where  c  >  0  and  0  .  =  6  all  i,j  (i,j  =  l,2,...,k)  are  known 

p+j  p*i 

values.  Here  may  be  taken  to  be  zero  with  no  loss  in  generality. 

P*J 

In  view  of  (2.1),  (3.14)  represents  a  restricted  parameter  configura¬ 
tion,  in  which  if  H ( I )  is  the  true  state  of  nature,  then  the  t  populations 
for  which  i .  =  1  in  I  are  the  t  best  populations.  For  the  problem 

at  hand,  this  is  the  least  favorable  configuration. 

In  order  to  develop  an  asymptotically  optimal  procedure  for  the 
ranking  problem,  we  develop  an  asymptotically  optimal  procedure  for  an 
associated  identification  problem,  with  0^,  the  common  location  of  the 
ranking  parameter,  as  the  least  favorable  location  of  the  ranking  parameter. 
We  solve  the  problem  for  a  restricted  parameter  configuration  given  by 
(3.14).  Since  (3.14)  represents  the  least  favorable  location,  the  procedure 
developed  is  asymptotically  minimax  over  all  parameter  configuration. 

For  j  *  l,2,...,k,  let  denote  the  decision  to  select  IK 

as  one  of  the  t  best  populations.  There  are  k  possible  decisions  and 

k 

(^)  states  of  nature;  the  simple  loss  function  for  the  problem  is  given 

as 


(3.15) 


W(0.U.) 


s. 


if  i  =  1  in  I 

J 


otherwise 


Preliminary  Sequence  of  A r t j_f icial  Problems 

We  now  develop  an  asymptoti cal ly  optimal  procedure  for  a  sequence 
of  artificial  k-decision  problems  (one  for  each  n). 
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_  M  (F)  _  M  (0) 

Suppose  it  is  known  that  6. - <  0.  <  9.  ♦  -  , 

1  /Z  “  1  '  1  /n 

i  *  1,2, ...,p,  and  (0  j,...,6p^)  satisfy  one  of  the  (t)  hypothesis 

given  by  (3.14).  Here  are  known  constants  and  0  <  c  <_  L, 

We  wish  to  select  one  of  the  t  best  populations,  with  loss  function 
given  by  (3.15). 

For  the  above  problem,  we  construct  a  Bayes  decision  rule  relative 
to  the  following  prior  distribution: 


For  j  *  1,2,...,(*),  a  total  mass  of  b^  is  spread  ui.’formly  over 


the  set 


(3.16) 


M  (0)  M  (0) 

0.  -  —  -  <  8.  <  0.  ♦  -n  ■  i  *  1,2, ...,p 

1  ^  -  1  -  1  4 


and  is  truejwhere  >_  0  for  j  *  1,2, ...,(£), 


I  b.  =  1 

i  =  l  J 


and  H.  is  one  of  the  (^)  hypothesis  given  by  (3.14). 

The  prior  distribution  is  arbitrary,  but  fixed.  Later  we  select 
the  prior  in  such  a  way  as  to  obtain  a  minirax  decision  rule  for  the 
problem  at  hand. 

Before  developing  a  Bayer  decision  rule,  we  define  some  notation 


used  later 


on.  For  I  =  1,2,...,  (*),  let 
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(3.17)  bj (n) 


Mn(0) 

6^—2 - 


'I 


Jm  (e) 

V" 


M  (0) 

e  — 

Pr  /K 


n 

n 


M  (0) 

e  --2 — 
P 


/K 


f(Xi;e,H(i))d61,...,dep 


Let  the  ranked  values  of  b^(n)  be  denoted  by 


(3.18)  bm(n)  <  bm(n)  <  . <  b  .  (n) 

[1]  -  [2]  -  -  [(*)] 

k 

Also  for  j  =  1,2, ...,(),  let 

(t> 

(3.19)  b .  (n)  »  b .  (n) /  [  b  (n) 


thesis 


Then  we  may  view  (n) 
Hj,  if  we  consider  b^ 
Using  the  above  notation. 


as  the  aposteriori  probability  for  hypo- 
as  the  apriori  probability, 
for  j  *  l,2,...,k,  we  obtain 


(3.20)  k(D.;x)  =  £  bj(n)  -  l  bj(n) 

311  1  (I  »  (i1.i2 . 

Thus  a  Bayes  decision  rule  is  to  select  population  IK  (decision  D^) 

as  a  best  population  if  [  bj(n)  is  maximized. 

( I  |ij“l} 

k-1 

Note  that  for  each  population  IK,  j  =  l,2,...,k  there  are  (^  ^) 

terms  in  the  summation  1  b.(n).  Also  there  are  many  common  terms 

alyl)  1 

occurring  in  each  sunmation  being  compared.  We  rewrite  the  Bayes  decision 
rule  (for  convenience  in  analysis)  as  follows: 


* 


-88- 


Select  J!  if 
l 

(3.21)  l  b.  (n)  >  l  b  (n)  j  t  i 

Ulij-1)  (I  |ij*l  t,j  =  1,2, ...,k 

Eliminating  the  common  terms  from  the  above,  the  Bayes  decision  rule 

reduces  to 

Select  n.  if 
l 

(3.22)  l  bj(n)  >  l  bj(n) 

UlV1} 

i j*0  it-0 

k-2 

Now  there  are  exactly  (t_^)  terms  in  each  summation  above.  Because 
of  the  symmetry  in  the  two  summations,  there  is  for  every  term  in  the 
summation  at  the  left,  exactly  one  term  in  the  summation  at  the 
right  which  has  the  same  I  except  for  the  obvious  change  from 
(i^  *  1,  ij  ■  0)  to  (i^  =0,  ij  ■  1).  Thus  we  may  rewrite  the 
Bayes  decision  rule  as: 

Select  n£  as  one  of  the  t  best  populations  if 


j  *  * 

t.j  •  1,2 . k 


(3.23) 


bI(i  .ij")  .0;FP) 

K - TnT -  >  1  j  /  i 

bI(ij*l,  i^OjFP)  *,j  *  1,2,... ,k 


where  FP  is  a  fixed  permutation  of  the  remaining  (k-2)  terms  of  I. 


:ve  now  study  the  asymptotic  properties  of  the  Bayes  decision  rule 
given  by  (3.23).  The  basic  results  obtained  in  Section  2.4  apply  in  this 
case  toe.  The  method  of  analysis  parallels  that  used  in  Section  2.4  and 
hence  will  not  be  repeated  here.  An  asymptotically  minimax  decision  rule 
reduces  to  the  following: 

For  Jl  =  l,2,...,k  select  as  one  of  the  t  best  population 

if 


(3.24) 


wn(A|j)  >  0 


where  (£ | j )  is  as  defined  by  (2.54). 

The  decision  rule  given  by  (3.24)  is  Bayes  decision  rule  for  the 

artificial  k-decision  problem  for  the  symmetric  prior  distribution, 

1  k 

{b.  * -r— ,  j  ■  1,2, . ..,(}},  As  in  Chapter  2,  the  symmetric  prior  gives 
J  (*) 

a  minimax  decision  rule  because  of  the  natural  symmetry  in  the  ranking 
problem  being  considered. 


Optimal  Procedure  for  the  "Real"  Problem 

The  alove  sequence  of  problems  was  artificial  because  we  assumed 
W-*'Fp  were  known.  We  now  proceed  to  the  real  problem  where  nothing 
is  known  about  the  nuisance  parameters  (ei»®2*****0p^ • 

A  A  A 

Let  0j(n),02(n),...,0p(n)  be  estimators  of  0j,02*...»8p  respectively, 
based  on  the  observations  (X1,...,X  ).  We  assume,  as  in  Chapter  2,  that 
these  estimators  satisfy  the  consistency  condition  (2.74).  MPE  could  be 


used,  although  any  set  of  consistent  estimators  would  suffice. 

For  notational  convenience,  let 

(3.2S)  8(n)  *  <VVn),82(n) . «p.2 . Vk> 

Using  (3,13)  and  (3.14),  an  asymptotically  optimal  procedure  for  the 
real  problem  is  given  by  the  following  theorem. 

Theorem  3.2.  Ail  asymptotically  optimal  decision  rule  for  selecting  one 
of  the  t  best  populations  is  to  select  the  population  associated  with 

T(kj(n). 

The  proof  of  the  theorem  parallels  the  proof  of  Theorem  2.4. 

3.3  Asymptotically  Optimal  Procedure  for  Goal  I 

In  Sections  3.1  and  3.2,  we  have  considered  special  cases  of  Goal 
I  and  developed  asymptotically  optimal  procedures  for  the  special  cases. 

In  this  section,  we  obtain  an  asymptotically  optimal  procedure  for  the 
more  general  Goal  I. 

The  analysis  for  this  problem  is  very  similar  to  the  one  outlined 
in  Section  3.2  and  will  not  be  repeated.  The  least  favorable  configuration 
for  this  problem  is  given  by  (3.14). 

Let  I  be  as  defined  earlier  in  Section  3.2  and  let  J  denote 
(j1.j2,...»jk)»  where  for  t  =  l,2,...,k,  jt  is  equal  to  one  or  zero 
and  exactly  s  of  the  j  are  equal  to  one.  Also  let  D(J)  denote 

the  decision  to  select  the  s  populations  for  which  j  =  1  in  J. 
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k 

For  the  ranking  problem  being  considered,  there  are  (^)  states 
k 

of  nature  and  (s)  possible  decisions.  The  loss  function  for  the 
(g) -decision  problem  is  given  as 


(3.26) 


W(6;D(J)) 


if  I  it-it  i<> 

£«1 


otherwise 


when  I  =  (ij,^, ...,i^)  is  the  state  of  nature  and  D(J)  is  the  decision 
made. 

The  loss  function,  given  by  (3.26),  implies  that  the  loss  is  zero 
if  the  selected  subset  of  size  s  contains  at  least  d  of  the  t  best 
populations;  otherwise  the  loss  is  equal  to  one. 

For  the  ranking  problem  being  considered,  with  the  loss  function 
defined  by  (3.26),  using  (3.11)  and  (3.12),  an  asymptotically  optimal 
procedure  is  given  by  the  following  theorem. 


Theorem  3.3.  An  asymptotically  optimal  procedure  for  selecting  a  subset 
of  size  s  to  contain  at  least  d  of  the  t  best  populations  is  to 
select  the  s  populations  associated  with  T[k]  *^[k  1]  s*l]^’ 

The  proof  of  the  theorem  parallels  that  of  Theorem  2.4  (with  added 
notational  complexities)  and  will  not  be  repeated. 


3.4  Asymptotically  Optimal  Procedure  for  Goal  II 

In  order  to  develop  an  asymptotically  optimal  procedure  for  the  rank¬ 
ing  problem,  we  formulate  the  problem  in  a  decision  theoretic  structure. 
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Let  I  =  (ij,^,...,^),  where  for  j  *  1,2,. ..,k,  i_  is  equal  to 


an  integer  between  0  and  t.  Exactly  (k-t)  i^  are  zero  and  of  the 


remaining  exactly  one  ij  is  equal  to  integer  m  (m  =  1,2, ...,t).  Thus 


each  I  identifies  a  particular  partition  of  the  k  population  into  t 
ordered  "best  set"  of  populations  and  a  set  of  unordered  (k-t)  "worst" 


populations,  ("best  set"  here  implies  the  set  of  populations  in  which 


th 


the  best,  second  best,  third  best,  etc.,  and  finally  t  best  population 


ki 

TJ^tTT 


are  identified).  Thus  there  are  /(.-"rvi  distinct  possible  I,  each 


corresponding  to  a  particular  state  of  nature.  (We  are  excluding  from 


our  consideration  any  ties  which  may  occur  for  the  set  of  t  best  popula- 

kl 

tions).  For  any  one  of  the  I,  let 


(3.27)  H(I) :  6p>. 


0  .  ♦  c/^iT 


if  i  .  *  0 
) 


ep-fj  -  (2i.  -  1)  if  i  .  >  0  j  «  1,2 . k 


where  c  >  0  and  0p^  ■  0  ^  (i,j  »  l,2,...,k)  are  known  values. 

may  be  taken  to  be  zero  with  no  loss  in  generality.  Here,  (3.27)  represents 


a  parameter  configuration  in  which  the  best  population  has  a  value  of  the 


2c 


parameter  of  interest  1 1>,  —  units  greater  than  the  second  best  population 

</i\ 


2c 


which  in  turn  is  —  units  greater  than  the  third  best  population,  etc., 

</j\ 

and  finally  the  t*^  best  population  has  a  value  of  the  parameter  ^ 


2c 


—  units  greater  than  the  remaining  (k-t)  worst  populations.  It  follows 


directly  that  this  type  of  parameter  configuration  is  the  LFC  for  the 
problem  at  hand  under  the  indifference  zone  approach  that  we  are  considering 


(See  Bechhofer  [4]  for  definition  of  the  indifference  zone  for  such  a 

problem).  However  the  indifference  zone  approach  to  this  problem  does 

not  restrict  itself  to  considering  least  favorable  con fig'i rat ions  (LFC's) 

in  which  the  i**1  best  and  (i ♦  1 ) s t  best  populations  (i  *  1,2, ...,t)  have 

2c 

the  same  distance  between  them  (*  —  in  our  notation).  The  more  general 

c.  ^ 

setup  with  4/^  ^  ^  ^  (i  «  0,1,2, ...,t-l)  could  be  con¬ 

sidered  too;  but  since  that  only  introduces  more  notational  complexities 
in  the  problem  without  adding  anything  new,  we  will  only  consider  the 
LFC  as  given  by  (3.27)  (with  the  understanding  that  suitable  changes 
would  be  made  in  computing  the  probability  of  correct  selection  when 
using  the  procedure  for  a  more  general  setup.  The  procedure  given  below 
is  optimal  even  for  the  more  general  setup.) 

Let  J  »  where  for  a  ■  l,2,...,k,  ja  is  equal 

to  an  integer  between  0  and  t.  Exactly  (k-t )  j  are  zero  and  of 

fl 

the  remaining,  exactly  one  j  is  equal  to  the  integer  m  (m  *  l,2,...,t), 

Let  D(J)  denote  the  decision  to  select  n.  (a  «  l,2,...,k)  as 

^a 

st 

the  (t-j  *1)  best  population  (for  j  >0).  The  remaining  (k-t) 

d  a 

populations  are  identified  as  the  set  of  worst  populations.  Since  there 
k  I  k  I 

are  distinct  J's,  we  get  ^  ^  |  distinct  possible  decisions. 

With  the  above  notation,  the  loss  function  for  this  problem  is 
given  as  follows: 

For  any  particular  decision  D(J)  and  state  of  nature  I, 


(3.28) 


W(6;D(J)) 


0  if  J  •  I 


1  otherwise 


The  simple  loss  function  (3.28)  implies  that  the  loss  is  zero  when 
the  right  set  of  t  populations  are  identified  as  the  best  set;  and 
the  loss  is  one  otherwise-. 

VI 

Thus  we  have  a  multiple  decision  problem  with  possible 

kl 

decisions  and  ^  possible  states  of  nature  (exactly  one  decision 

is  correct  for  each  state  of  nature).  This  problem  is  very  similar  to 
the  problem  of  selecting  the  best  population  (k  possible  decisions  and 
k  states  of  nature)  considered  in  Chapter  2.  The  analysis  involved 
in  developing  an  asymptotically  optimal  procedure  for  this  problem  is 
a  complete  repetition  of  the  analysis  presented  in  Section  2.4,  with 
obvious  modifications  introduced  by  using  (3.27)  instead  of  (2.17)  for 
the  least  favorable  configuration  for  the  ranking  problem.  We  thus  do 
not  repeat  the  analysis  and  simply  given  an  asymptotically  optimal  pro¬ 
cedure  for  the  problem. 

Using  (3.11)  and  (3.12),  an  asymptotically  optimal  procedure  is  given 
by  the  following  theorem. 

Theorem  3.4  An  asymptotically  optimal  procedure  for  selecting  the  t 
best  populations  with  regard  to  order  is  to  select  the  populations  associated 
with 

and  the  t**1  best  population  respectively. 

3.S  Certain  Other  Possible  Goals  in  Ranking  and  Selection  Problems 

The  ranking  goals  considered  thus  far  include  almost  all  the  goals 
considered  in  the  literature  on  ranking  and  selection  problems  (using  the 
indifference  tone  approach).  As  pointed  out  earlier,  it  is  assumed  that 


(n),  •..,Tjjt^|jj  as  the  best,  the  second  best,  etc. 
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the  common  functional  form  of  the  density  function  of  observations  from 
each  of  the  populations  is  known.  The  populations  are  ranked  according 
to  values  of  a  parameter  of  interest  There  may  or  may  not  be  other 
unknown  parameters  in  the  distribution,  which,  when  present,  would  be 
classified  as  nuisance  parameters  for  the  ranking  problem. 

In  actual  practical  applications,  there  may  be  situations  where  the 
criterion  for  "goodness"  of  populations  is  more  complicated  and  must  be 
properly  interpreted  before  a  ranking  procedure  is  developed.  For 
example,  in  many  engineering  applications,  the  quality  of  manufactured 
goods  is  characterized  by  the  product  meeting  some  fixed  specifications. 
Thus,  for  example,  one  may  be  interested  in  selecting  a  manufacturing 
process  which  has  the  highest  probability  of  coverage  of  the  specification 
interval.  For  this  problem  "Converge  Probability"  is  the  criterion  for 
"goodness"  of  populations.  Guttman  (22]  and  Guttman  and  Milton  [23] 
consider  this  problem  and  have  developed  procedures  for  selecting  a 
(random)  subset  to  contain  the  best  population  for  normal  and  exponential 
density  functions,  when  a  one  sided  fixed  tolerance  region  is  the 
criterion  for  "goodness". 

In  the  example  cited  above  and  other  related  ranking  problems,  the 
criterion  for  "goodness"  of  populations  may  be  specified,  but  ranking 
procedures  can  be  developed  only  after  the  criterion  is  translated  into 
a  goal  involving  the  parameters  of  the  population.  Two  special  cases 
of  interest  may  arise.  First,  it  may  be  possible  to  redefine  the 
parameters  of  the  populations  in  such  a  way  that  the  "criterion"  is 
translated  into  a  parameter  of  interest  *  and  there  may  or  may  not  be  any 
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* 


nuisance  parameters.  If  the  redefined  parameters  satisfy  the  assumptions 
and  regularity  conditions  of  Section  2.1,  then  the  entire  analysis  carried 
out  earlier  in  this  thesis  would  be  applicable.  If  the  regularity  condi¬ 
tions  are  not  satisfied,  one  may  proceed  to  develop  asymptotically  optimal 
procedure  as  outlined  in  Chapter  4  for  non-regular  cases.  Secondly,  if 
after  redefining  the  parameters,  the  nuisance  parameters  are  functions 
of  the  parameter  being  ranked,  then  special  analysis  would  be  needed  for 
each  particular  case.  To  our  knowledge,  the  only  paper  in  the  literature 
dealing  with  such  a  case  is  by  Chambers  and  Jarratt  [12],  which  deals 
with  the  problem  of  ranking  means  of  populations  when  the  variances  are 
a  known  function  of  the  means  being  ranked. 

We  do  not  propose  to  consider  problems  falling  into  this  second 
category  in  this  thesis.  However,  it  should  be  pointed  out  that  the 
general  method  developed  would  be  applicable  to  this  class  of  problems, 
separate  analysis  being  required  for  each  specific  case  (each  specific 
case  being  characterized  by  a  known  functional  relation  between  the 
nuisance  parameters  and  the  parameter  being  ranked). 

In  our  analysis  we  have  assumed  that  the  correct  pairing  of  the 
populations  and  the  ranked  parameters  are  unknown,  but  the  values  of  the 
parameters  of  interest  are  assumed  to  be  unknown.  Dunnett  [16]  and 
Guttman  and  Tiao  [24]  deal  with  the  normal  means  problem  when  prior  informa¬ 
tion  is  available  about  the  possible  values  of  parameters  ^  (i  ■  l,2,..,k) 
and/or  the  correct  pairings  of  the  populations  and  ranked  parameters.  The 
ranking  goals  considered  are  the  same  as  the  ones  considered  by  us. 
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Although  we  do  not  concern  ourselves  with  such  Bayesian  analyses  of  the 
ranking  problem,  we  should  mention  that  the  procedures  that  we  have 
developed  are  also  asymptotically  optimal  from  a  Bayesian  viewpoint; 
that  is*  when  the  apriori  information  is  available  about  the  possible 
pairings  of  populations  and  the  ranked  parameters.  This  is  so  since 
asymptotically  minimax  (optimal)  ranking  procedures  were  constructed 
from  Bayes  decision  rules  for  suitably  defined  multiple  decision  problems 
by  suitable  choice  of  prior  distribution. 

Lastly  we  would  like  to  point  out  that,  in  our  analysis,  we  have 
assumed  that  we  know  the  functional  form  of  the  joint  density  function 
of  observations  from  the  set  of  populations.  This  was  used  explicitly 
in  constructing  asymptotically  optimal  decision  rule.  Hence,  we  are 
not  concerned  with  a  class  of  nonparametric  ranking  problems,  in  which 
the  form  of  the  joint  density  function  is  not  known.  Different  type 
of  analysis  would  be  needed  for  such  ranking  problems. 


1 


CHAPTER  4 


* 


RANKING  PROBLEMS  IN  NON-REGULAR  FAMILIES  OF  DISTRIBUTIONS 

To  this  point,  we  have  developed  asymptotically  optimal  procedures 
for  certain  ranking  goals,  for  situations  in  which  the  joint  density 
function  of  the  observations  satisfies  certain  mild  regularity  conditions 
imposed  in  Section  2.1.  There  are  many  functions  which  do  not  satisfy 
these  regularity  conditions,  but  which  occur  frequently  in  practice. 

The  general  method  developed  in  previous  chapters  to  obtain  an 
asymptotically  optimal  procedure  for  certain  ranking  goals,  using  the 
regularity  conditions,  can  often  be  used  for  the  non- regular  cases  too. 
However  each  non-regular  case  must  be  treated  separately.  The  ba.ic  idea 
of  using  local  Bayes  rules  to  develop  asymptotically  optimal  procedures 
in  the  presence  of  several  nuisance  parameters  would  be  useful  in  all  such 
cases,  the  actual  analysis  being  different  in  each  case  (due  to  say, 
different  normalizing  constants  and  the  actual  functional  forms  of  the 
distributions) . 

In  order  to  illustrate  the  applicability  of  our  method  to  non¬ 
regular  cases,  we  consider  some  particular  non- regular  density  functions 
and  develop  asymptotically  optimal  procedures  for  the  ranking  goals 
considered  earlier  in  this  thesis  for  regular  cases. 

Before  we  consider  such  problems,  we  list  certain  non- regular 
density  functions  which  may  occur  frequently  in  practical  situations. 

The  list  contains  most  of  the  interesting  known  cases.  Most  of  these 
are  taken  from  Weiss  and  'Volfowitz  ([51J  and  [54]). 


-98- 


-99- 


4.1  Some  Non-Regular  Density  Functions 

Let  f(xje)  denote  the  density  function  of  a  random  variable  X, 
characterized  by  parameter  0  (=  (0i,02»...,0  ))»  which  in  general  may 
be  multidimensional.  Let  0  denote  the  parameter  space,  a  suitably 
defined  subspace  of  m  dimensional  Euclidean  space  (m  being  the  dimension 
of  9).  Whenever  m  =  1,  the  parameter  is  denoted  by  6  and  for  m  >  1, 

9  denotes  the  vector  of  parameters,  each  component  being  denoted  as 
0i#  i  =  1,2, ...,m. 

The  following  constitute  some  interesting  "non-regular"  univariate 
density  functions. 


I  f(x | ©}  =  0  for  x  <  0j 

f(e1+|e)  =  h(0)  >  0 

where,  0  =  { (e1*°2^  1“°°  <  ei  <  °°»  0  <  02  < 


Example 


(4.1) 


f(x|9) 


_  / 


r  <x-v 

!_  e"  02 
e2 


for  x 


otherwise 


and  0  =  { (Gj ,92)  I-00  <  <  °°»  0  <  ©2  <  °°}. 


II 


if  x  <  9  or  x  >  B(8) 

g(0)  >  0 


f(x|e)  =  0 

f(0+|e)  = 

f(B(9)-|9) 


h(e)  >  o 
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g,  h  and  B*  =  are  continuous  functions  of  6.  Also 
(Note  that  I  is  a  special  case  of  II  with  3(6)  =  ®).  0 

defined  for  each  particular  case. 


Examples  (i)  B'  <  0 


(4.2)  f (x | 6) 


— ~  for  6  <  x  <  1/9 

ue 

0  otherwise 


where, 


e  =  { e | o  <  e  <  1} 


(ii)  B*  >  0 


r 


(4.3) 


f(x|e)  = 


1  ♦  j  (x-e)  for  e  <  x  <  e 


otherwise 


where, 


0  =  {  0  |  -CO  <  0  <  »} 


(iii)  One  of  the  end  points  is  a  constant;  i.e 


This  case  can  be  represented  by  the  following: 


(4.4) 


for  0  <  x  <  6 


otherwise 


Majasaas 


B(6)  >  e. 
is  suitably 


♦  2/3 


B'  =  0. 


where,  0  =  { 0 1 0  <  0  <  °°} 


m 

B  for  0  x  0 

(4.5)  f(x|0)  =  / 

for  0  <  x  <  1 

V.  0  otherwise 

where  B  is  a  known  constant  and 

0  =  { 0 10  <  0  <  1} 

IV  f (x |0)  =0  x  <  0j  or  x  > 

f(Y!0)  a  gCC©1.02:))  >  0 

f(e2-|e)  «  h((e1,02))  >  o 

g,h  are  continuous  functions  of 

Example 

(4.6) 


where. 
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V  Double  Exponential  Density  (Laplace  distribution) 

**  ^  |  x**  0  | 

(4.7)  f (x 1 6)  *  e  °2  1  for  <  x  <  - 

where , 

o  *  ((e1§e2)  |—  <  0j  <  *,  o  <  e2  <  -) 


VI  Weibull  Distribution 


(4.8) 


F(x|0) 


for  x  <  0. 


x-e 


1/9, 


I  -  e 


for 


x  1  61 


where. 


x 

F  (X 1 6)  *  /  f(y|e)dy,  is  the  cdf  of  the  Weibull  distribution. 


Also, 


0 


<  0j  <  -,  o  <  e2 


®,  0  < 


For  this  class  of  distributions,  is  the  "location"  parameter, 

is  the  "scale"  parameter  and  03  is  the  "shape"  parameter.  If  6^=1, 
we  obtain  the  exponential  density  function  (as  given  by  (4.1))  a  special 
case  of  the  class  of  Weibull  distribution.  If  0j  is  known,  (4.8)  reduces 
to  the  regular  case. 

Before  proceeding  to  develop  optimal  procedures  for  some  of  the 
density  functions  listed  above,  we  review  the  literature  in  ranking  and 
selection  procedures  for  non- regular  cases. 
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4.2  Brief  Review  of  Literature  on  Ranking  Problems  in  Non-Regular  Cases 

Sobel  [43]  developed  sequential  procedures  for  ranking  scale 
parameters  of  exponential  populations,  (as  given  by  (4.1)  in  Section  4.1 
and  02  is  the  scale  parameter)  with  known  location  parameters  (same  or 
different  for  each  population)  or  a  common  unknown  location  parameter. 

In  the  case  of  a  common  unknown  location  parameter,  Sobel  assumed  that 
if  the  unknown  location  parameter  was  greater  than  or  equal  to  zero, 
zero  is  taken  to  be  the  value  of  the  unknown  location  (nuisance)  parameter 
and  a  procedure  satisfying  the  basic  probability  requirement  was  developed. 

Bechhofer,  Kiefer  and  Sobel  [7]  developed  sequential  procedures 
for  ranking  problems  associated  with  the  Koopoan-Darmois  family  of 
distributions;  these  would  be  applicable  to  ranking  scale  parameters  of 
exponential  populations  when  the  location  parameters  are  known. 

Barr  and  Rizvi  [2]  developed  a  simple-stage  procedure  for  ranking 
uniform  distributions  (density  function  as  given  by  (4.4)  in  Section  4.1). 
Using  a  zero-one  type  loss  function  (as  we  have  done  throughout),  Barr 
and  Rizvi  showed  that  the  procedure  they  developed  is  minimax  and  is  a 
most-economical  decision  rule  (in  the  sense  of  Hall  [25]).  They  also 
showed  that  the  selection  procedure  may  easily  be  extended  to  a  larger 
class  of  non- regular  distributions,  given  in  Hogg  and  Craig  [27],  This 
class  of  distributions  corresponds  to  Case  II  in  Section  4.1. 

Dudewicz  [15]  in  determining  the  efficiency  of  a  non-parametric 
selection  procedure  given  by  Bechhofer  and  Sobel  [9]  (for  the  location 
parameter  case)  against  parametric  alternatives,  proposed  single  stage 
procedures  for  ranking  from  uniform  distributions  (uniform  between 
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(e-1/2,  e*l/2)).  (The  nonparametric  selection  procedure  of  Bechhofer 
and  Sobel  (9]  is  also  given  as  a  selection  procedure  in  Bechhofer, 

Elmaghraby  and  Morse  [6]).  Dudewicz  [15]  proposed  a  midrange  procedure 
(i.e.,  populations  are  ranked  according  to  the  values  of 

(max.  of  obs}»(min.  of  obs)^  an(j  a  means  procedure  (i.e.,  populations 

are  ranked  according  to  the  values  of  the  sample  means)  for  ranking  from 
uniform  distributions. 

Mahamunulu  ([32],  [33])  proposed  a  single-stage  procedure  for 
ranking  k  populations  {IT,  i  «  1,2, ...,k),  the  selection  procedure 
based  on  suitable  statistics  T^.T^,...,^.  is  computed  from  a  random 

sample  of  size  n  from  IT  (i  ■  l,2,...,k).  The  proposed  procedure 
is  applicable  to  ranking  populations  according  to  Goal  I  (defined  earlier 
in  Chapter  3)  and  for  situations  in  which  ^  is  an  absolutely  continuous 
random  variable  and  its  distribution  function  is  stochastically  increasing 
in  the  parameter  being  ranked  (see  Lehmann  [30]  for  definition  of  stochastic 
increasing  family  of  distributions). 

The  procedure  developed  by  Mahamunulu  is  applicable  to  ranking 
problems  associated  sith  the  following  non- regular  families  of  distributions 
(i)  uniform  distribution  (Case  II  in  Section  4.1)  (ii)  exponential 
distribution  (Case  I  in  Section  4.1)  and  (lii)  double  exponential 
distribution  (Case  V  in  Section  4.1).  The  proposed  procedures  are 
applicable  when  no  nuisance  parameters  are  present  (or  they  are  arbitrarily 
removed  from  explicit  consideration).  Mahamunulu  also  indicated  how  the 
tables  developed  for  the  case  of  ranking  means  of  normal  distribution  with 
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common  known  variances  could  be  used  as  an  approximation  for  ranking  from 
other  distributions,  whenever  the  statistics  used  for  ranking  the  popula¬ 
tions  ("T)  converged  asymptotically  to  a  normal  distribution. 

Mahamunulu  ([33],  p.  1082)  pointed  out  that  if  a  sufficient  statistic 
of  fixed  dimension  for  all  n,  exists  (for  the  parameter  being  ranked), 
then  'T  is  some  appropriate  function  of  the  sufficient  statistic.  The 
choice  of  T\  becomes  a  problem  only  when  such  a  sufficient  statistic 
does  not  exist.  In  the  latter  case,  the  author  advocates  using  a  statistic 
such  that  the  induced  family  of  distributions  is  a  stochastically  increasing 
family  of  distributions. 

4.3  Asymptotically  Optimal  Procedures  for  Ranking  Non-Regular  Exponential 
Distributions 

We  now  proceed  to  develop  asymptotical ly  optimal  procedures  for 
certain  ranking  problems  associated  with  populations  having  non-regular 
exponential  distributions.  We  develop,  in  detail,  asymptotically  optimal 
procedures  for  selecting  the  best  population  (defined  appropriately).  We 
then  develop  an  asymptotically  optimal  procedure  for  other  ranking  goals 
considered  earlier  in  this  thesis.  Later,  we  consider  non- regular  uniform 
density  functions  and  develop  asymptotically  optimal  procedures  for  the 
above  mentioned  ranking  goals.  The  method  of  analysis  used  in  this  section 
carries  over  to  most  of  the  non-regular  cases. 

Former tion  of  the  Problem 

We  formulate  the  problem  in  decision  theoretic  structure  using  the 
notation  defined  in  Chapter  2, 
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Let  X  ^  i  =  l,2,...,n  denote  n  independent  observations  from 
nt,  t  =  1,2, ...,k,  each  with  pdf  f  ( • I  • )  given  as 

x’Lt 

T 


r  -(^ 
l 


(4.9) 


ft(x|6)  -  f(x|Lt,St)  -(  1 


*lLt 


otherwise 


where  Lt,St  denote  the  two  unknown  parameters  of  the  distribution 
characterizing  population  n  .  Here  L^  denotes  the  location  parameter 
and  S(,  the  scale  parameter. 

Two  cases  of  interest  arise  here. 

(i)  is  the  parameter  being  ranked;  is  the  nuisance  parameter. 

(ii)  1/St  is  the  parameter  being  ranked;  is  the  nuisance  parameter. 

Each  case  is  treated  separately  below.  We  first  define  some 
additional  notation. 

Case  (i)  St  is  the  common  unknown  nuisance  parameter 
For  t  ■  1,2, ...,k,  let 


(4.10) 


li  a  L 
\  t 


(4.11) 


S.  -  81 


and 


*t  ■  e0  '  6l*t 


(4.12) 
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Also  for  i  =  1,2,. ..,k,  let 


r 

Vi  = 

Vi  -  c/n 

(4.13) 

H. 

l 

=  ) 

i  6l*j  = 

Vj  4  c/n 

j  =  l,2,...,k 

L 

t  i 

where  c  >  0 

and 

e,  .  =e,  . , 

l+i  l+j* 

all  i ,  j  ( 

i,j  ~  1 , 2, • . .  ,k) 

ire  known 

values,  0^^  is  taken  to  be  zero,  with  no  loss  in  generality. 

The  above  relations  imply  that  we  are  ranking  location  parameters, 

with  the  common  unknown  scale  parameter  being  the  nuisance  parameter. 

In  the  parameter  configuration  given  by  (4.13),  selecting  IT  as 

the  true  hypothesis  is  equivalent  to  selecting  IT  as  the  best  population 

(i  =  l,2,...,k).  The  above  hypotheses  also  imply  that  the  best  population 

2c 

has  a  4--value  which  is  units  greater  than  that  of  the  remaining 

(k-1)  populations.  This  corresponds  to  the  least  favorable  configuration 
(LFC)  for  the  ranking  problem  under  consideration  under  the  indifference 
zone  approach. 

For  i  «  1,2,...  ,k,  let  denote  the  decision  to  select  IT 

as  the  best  population.  The  loss  function  for  the  k-decision  problem 
is  given  as 


(4.14) 


W(e;D.) 


0  if  IT  is  the  true  hypothesis 


L 


otherwise 
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Asymptotically  optimal  decision  rules  (procedures)  will  be  developed 
for  the  above  problem. 


Case  (ii)  is  the  common  unknown  nuisance  parameter 


For  t  =  1.2. 

v 

let 

(4.15) 

*t  = 

i/st 

(4.16) 

h 

*  61 

and. 

(4.17) 

Ct  * 

eo  ‘  Vt 

Also  for  i  * 

1.2... 

. ,k,  let 

Vi 

=  Vx  * 

c//n 

(4.r.) 

H. : 

l 

Vj 

*  v,  * 

c/ /n  j  ■ 

i  i 

where  c 

>0,  6. 

-  Vi 

all  i. 

j  (i,j  =  1,2, ...,k)  are  known  constants 

0,  .  is 
1*1 

taken  to  be  zero. 

with  no 

loss  of  geneiulity. 

In 

this  case 

0j,  the  common 

unknown  loc;ii'on  parametei. 

is  the 

nuisance 

parameter 

for  the 

problem 

of  ranking  scale  parameters. 

Here, 

(4.18)  represents  t lie  least  favorable  configuration  for  the  ranking 
problem  under  consideration. 
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For  i  =  l,2,...,k,  let  D.  denote  the  decision  to  select  n. 
as  the  best  population.  The  loss  function  for  the  k-decision  problem 
is  given  by  (4.14). 


Preliminary  Sequence  of  Artificial  Problems:  S»  is  the  nuisance  parameter 

Before  proceeding  to  the  real  problem,  we  solve  a  sequence  of 
artificial  k-decision  problems  (one  for  each  n)  for  case  (i). 

Suppose  it  is  known  that  0^  =  0^  and  satisfy 

one  of  the  k  hypotheses  given  by  (4.13).  6^  is  a  known  constant  and 
0  <  c  L.  We  wish  to  test  which  one  of  the  k  hypotheses 
(given  by  (4.13))  is  the  true  hypothesis.  The  loss  function  is  of  the 
zero-one  type,  given  by  (4.14). 

For  the  above  problem,  we  construct  a  Bayes  decision  rule  relative 
to  the  following  apriori  distribution: 

For  j  ■  1,2, ...,k,  b.  is  the  apriori  probability  that  H.  is 

^  k  ^ 

true  hypothesis,  where  for  j  *  l,2,...,k,  b.  ^0  and  [  b.  *  1. 

J  j*l  J 

In  order  to  obtain  a  Bayes  decision  rule,  we  must  compute  for 
i  =  l,2,...,k,  k(D^;x),  and  it  can  be  easily  seen  that  relative  to  the 

above  apriori  distribution,  and  the  loss  function  given  by  (4.14),  a 
Bayes  decision  rule  reduces  to  the  following: 

For  *  «  1,2,... ,k,  select  Hf  as  the  true  hypothesis  (equivalently 
n  as  the  best  population)  if 


(4.19) 


v1”  i  ^ 


j  *  1.2, 
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where,  for  £,j  =  1,2, 

n 

n  ftx.jej,^) 

(4.20)  J„U|j)  =  y  — — - 

n  f(X.;6.,H  ) 

i=l  1  1  J 

Here  f  (X ^ ; 9”^ , )  denotes  the  joint  pdf  of  »X2i *  *  *  *  ,Xki ^  ’  when 

(6 1 ,02, . . . » 6^^ i )  is  the  true  parameter  value  for  X  and  (02» ®3*  •  •  • 
satisfy  ,  as  given  by  (4.13). 

Since  we  assume  that  the  observations  are  independent, 

k 

(4.21)  f (X  ;0,H  )  *  n  f  (X  ;6,H  ) 

1  J  i*l  1  11  3 

where  *t^Xti;6,Hj^  is  given  by  (4.9). 

Using  (4.9)- (4.12)  and  (4.21),  for  j  -  1,2, ...,k 

n  k  n 

(4.22)  n  f(X  ;9,H  )  -  n  (  n  f(X  ;6,H  )) 

i»l  J  t*l  i*l  } 

where,  for  t  i  j 


(4.23)  H  ft(Xti;6,H  ) 

1*1  J 


if  min  lxti )  <  -  c/n 

1  <  i  <  n 


'  1  r 

hi  exP(-  2T  I  otherwise 

1  01  i  =  l  tl  ° 


and  for  t  *  j 
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(4.24)  n  ft(xti;e,Ht)  = 
i  =  l 


if  min  (X  .)  <  0q  ♦  c/n 

l<i<n 


1  11 

e‘n  exp(-  —  [  (Xti-eQ-c/n))  otherwise 
°1  i=l 


For  t  =  1,2, ...,k,  let 


(4.25) 


V  =  min  (X  . ) 
l<i<n 


and  denote  the  ranked  values  of  V  by 


(4.26) 


Y[l]  i  Y[2]  i  •••*  i  Y[k] 


Using  (4.21)- (4.25) ,  we  obtain,  for  j  =  l,2,...,k. 


e  _nk  exp[-  I  l  (X  -e  )]exp[-  (k-2)] 

1  t  =  l  i*l  k  nOj 

if  (Y.  >  eQ  ♦  c/n)  H  (Y^  >  eQ  -  c/n) 

J  i  - 1  1  ‘ 


k  n 


(4.27)  n  f(x.;6 ,.H  ) 

i  =  l  J 


otherwise 


If  we  denote,  for  j  =  l,2,...,k# 


(4.2S) 


6^  (n) 


•  b  1  f(.v  e  .:;,) 
J  i-l  1  1  ' 


then  tl'.e  Baves  decision  rale  mav  be  rewritten  as  follows: 
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For  e  =  l,2,...,k,  select  as  best  population  if 


(4.29) 


b£  (n)  1  bj(n) 


j  t  * 

j  ”  1,2, ... ,k 


From  (4.27),  we  note  that  for  any  given  set  of  observations,  the 
joint  density  function  under  each  FF  is  the  same,  whenever  the  joint 
density  function  is  positive  under  each  FF.  For  certain  sets  of  observa¬ 
tions,  the  joint  density  function  is  equal  to  zero  under  each  IF  (such 
cases  represent  regions  of  probability  zero  and  may  be  ignored  from 
consideration  in  constructing  Bayes  decision  rules).  For  the  remaining 
sets  of  possible  observations,  the  joint  density  function  may  be  zero 
under  some  hypotheses  and  equal  to  the  same  positive  value  under  the 
remaining  hypotheses  (as  is  clear  from  (4.27)). 

Since  we  are  interested  in  an  optimal  (minimax)  decision  rule  for 
the  artificial  sequence  of  problems,  we  need  not  rewrite  (4.29)  to 
develop  simplified  expressions  for  a  Bayes  decision  rule  for  any  general 
apriori  distribution.  We  note,  by  the  inherent  symmetry  in  the  problem 
that  a  minimax  decision  rule  will  be  given  by  the  Bayes  decision  rule 
with  the  prior  given  by 


(4.30) 


j  ■  l,2,...,k 


For  this  prior,  the  Bayes  decision  rule,  which  is  a  minimax  decision 


rule  too,  is  not  unique  because  of  ties  in  the  b^  (n)  (as  defined  by 
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(4,28)).  A  minimax  decision  rule  is  given  by  the  following: 

Select  as  the  best  population  if 

(4.31)  Y  >  Y  j  =  1,2, ...,k 

e  ~  J  t  i 

For  the  minimax  decision  rule  given  by  (4.31),  one  is  interested 
in  computing  the  probability  of  correct  selection.  This  will  be  needed 
in  showing  that  the  decision  rule  for  the  real  problem  is  asymptotically 
minimax. 

Note  that  if  X  .  (i  =  l,2,...,n)  have  a  distribution  given  by 
(4.9),  then 


or, 

(4.32) 


P(nY  ^  y)  =  P{  min  X  .  <_  y/n) 

l<i<n  Z1 


*  1  -  P{  min  X  .  >  y/n) 

l<i<n  U  _ 


=  1-11  (e”(y/n  '  *t)/St) 

i  =  l 

-y/St  /s 

P(nYt  <  y)  =  1  -  e  e  1  1 


From  the  above,  it  follows  that 

p{ruYt  "  V  i  y*  = 


(4.33) 


1  -  e 


-114- 


thus  from  (4.12),  (4.13)  and  (4,31)  and  denoting  the  common  known 
St  wc 

P(CS)  -  P(Yx  <_  Y(kj  i  •  1,2,..  ,,k ) 

where  ^(k)  15  °^5crvati°n  from  the  population  with  largest 

Rearranging,  we  obtain 

P(CS)  •  P( n( Y ^  -  (eQ  -  c/n)  <  n(Y^k)  *  (fl0  ♦  “)  ♦  2c  i  =  1,2,. ..,k 

■  P(Zi  <_  Z(k)  ♦  2c  all  i ) 
where  the  distribution  of  Z.  is  given  by  (4.33). 

Thus, 

P(CS)  .  /  |  1 

°\ 

■  if 

Denoting  e  by  u, 

P(CS)  «  /  (1  -  du 

0 


(4.34) 
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where. 


(4.35) 


c*  = 


2c 
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Evaluating  the  right  hand  side  of  (4.34),  one  can  determine  the 
P(CS)  achieved  for  the  optimal  procedure  for  any  given  sample  size  n. 

Asymptotically  Optimal  Procedure  for  the  Real  Problem 

The  above  sequence  of  problems  was  artificial  because  we  assumed 
was  known.  We  now  develop  an  asymptotically  optimal  procedure  for 
the  real  problem,  where  nothing  is  known  about  the  value  of  e^. 

Since  the  minimax  procedure  for  the  artificial  problem  did  not 
use  the  infort.'ition  about  9^  it  follows  immediately  that  an  asymptotically 
optimal  j  euure  for  the  problem  is  given  by  the  following  theorem. 

Theorem  4.1  An  asymptotically  minimax  procedure  for  selecting  the  best 
population  is  given  by  the  following: 

Select  ll2  as  best  population  if 

>_  Yj  i,  j  *  1 ,2, . . .  ,k 

j  i  l 

where  Y£  is  as  defined  by  (4.25). 

Proof :  The  decision  variables  have  the  same  asymptotic  distribution  as 
the  decision  variables  of  the  minimax  (and  Bayes)  decision  rule  for  the 


the  artificial  problem.  Thus  the  above  procedure  has  the  same  asymptotic 
Bayes  risk  and  is  hence  an  asymptotically  minimax  decision  rule. 


Q.E.D. 

It  is  interesting  to  note  here  that  unlike  the  regular  case,  where 
local  Bayes  rules  were  used  to  construct  an  asymptotically  optimal  proce¬ 
dure,  in  the  non-regular  case  being  considered  use  of  a  simple  Bayes  rule 
for  the  artificial  problem  (where  the  nuisance  parameter  is  known  with 
certainty)  enables  one  to  construct  an  asymptotically  optimal  ranking 
procedure. 


Preliminary  Sequence  of  Artificial  Problems:  is  the  Nuisance  Parameter 


We  first  solve  a  sequence  of  artificial  problems  ^one  for  each  n) , 
before  proceeding  to  the  real  problem  for  case  ( i i )  (i  .,  (4.15)  -  (4.18) 
hold). 

Suppose  it  is  known  that  0^  -  (T,  and  (e2,0j,...,ek+1)  satisfy 
one  of  the  k  hypotheses  given  by  (4.18).  0^  is  a  known  constant  and 

0  <  c  <_  L,  Ke  wish  to  test  which  one  of  the  k  hypotheses ,  given  by 
(4.18),  is  the  true  hypothesis;  and  the  loss  function  is  of  the  zero-one 
type,  given  by  (4.14). 

for  the  above  problem,  we  construct  a  Bayes  decision  rule  relative 
to  the  following  apriori  distribution: 


For  j  »  1,2,. ...k,  b. 

J 

true  hypothesis,  where  for  j 


is  the  apriori  probability  that  H.  is  the 

k  ^ 

=  1,2,.,. , k,  b.  >  0  and  £  b .  •  1 . 

J  j  =  l  } 
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It  can  be  seen  that  for  the  above  problem,  a  Bayes  decision  rule 
relative  to  the  above  prior,  reduces  to  the  following: 

For  i  =  l,2,...,k  select  as  the  true  hypothesis  (equivalently 

n  as  best  population)  if 

b . 

(4.36)  J  (i|j)  i  h1  j  *  1»2****»k 

SL  t  £ 


where,  for  t,j  =  l,2,...,k 


n 

n  fCXjjej.H^) 

(4.37)  JnU|j)  =  - - — 

n  fCx.jGj.H.) 
i  a  1  i 


Here  f(X^;0j,H  )  denotes  the  joint  pdf  of  (Xji >*2i *  *  * ' ,Xki^  when 
(0 j ,02, .  • .  *®k+ is  the  true  Parameter  point  and  C©2» e3*  •  •  •  ^ 

satisfy  H  ^ ,  as  given  by  (4.18), 

From  (4.9),  it  follows  that 


(4.38) 


r  (0n+c//n) (en-c//n)k_1  exp 


."^(vw  s  < 


« ■ 

-  (e0^c//n)  l  (X  -e  ) 
i  =  l  J 

n  k 

-(o0-c/*n)  l  l  (X  -e  ) 
i=l  t*l 

if  y  >  6j 


0 


otherwise 
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where. 


(4.39) 


Y  =  min  {X  } 
l<t<k 
l<i<n 


From  (4.38),  we  get 


(4.40) 


Jn(£|j)  =  g-  if  y  1  0j 


which  occurs  with  probability  zero  and  can  be  ignored; and 


(4.41) 


J  (n|j)  =  exp{2c(Z.-Z  )}  otherwise 


n'  - - j 


where,  for  l  -  1,2, ...,k. 


(4.42) 


■  j,  <x«i  -  V//ir 

i«l 


Thus  the  Bayes  decision  rule  reduces  to  the  following: 

For  i  *  1,2, ...,k  select  as  the  best  population  if 


(4.43) 


b. 

Z„  <  Z.  -  ~  log  r-l 
*  -  J  2c  B  bt 


j  =  1,2, ...,k 
*  L 


In  order  to  develop  a  minimax  procedure  for  the  problem,  due  to  the 

inherent  symmetry  in  the  problem,  the  prior  {b.  -  r  j  =  1.2, ...,k)  gives 

J  * 

a  minimax  decision  rule.  Thus,  a  minimax  decision  rule  for  the  artificial 
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problem,  is  given  by  the  following: 

Select  as  the  best  population  if 

(4.44)  Z  <  Z  £,j  =  l»2,...,k 

1  3  M  j 

The  minimax  decision  rule  is  completely  specified  by  (4.44);  but  we 
need  to  study  the  asymptotic  behavior  of  the  decision  variables  in  order 
to  develop  an  asymptotically  optimal  procedure  for  the  real  problem. 

For  £  =  1,2, ...,n  and  when  e"  is  the  common  location  parameter, 
we  have 


(4. 45) 


E (X  .  -  6.)  =  S„ 

£l  1  £ 


(4.46) 


VarXu  -  S/ 


and  by  a  central  limit  theorem. 


f  n 

X  (x£i  -  V 


(4.47) 


i  =  l 


lim  P\/n 


n-*» 


i 

l 


Yr  1  -z2/2 

J  -  e  dz 


I  X 


-•»  /W 


Thus  (4.47)  could  be  used  for  the  asymptotic  distribution  of  the 


decision  variables 
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Optimal  Procedure  for  the  Real  Problem 

The  above  sequence  of  problems  was  artificial  because  we  assumed 
that  0 ^  was  known.  We  now  develop  asymptotically  optimal  procedure 
for  the  problem,  where  nothing  is  known  about  0^. 

a 

Let  0j(n)  a  consistent  estimator  of  0^  be  defined  as 

(4.48)  9  (n)  =  min  (X  . } 

i<t<k 

l<i<n 

where  e^n)  satisfies 

(4.49)  P  (n| 0i (n)  -  9  |  <  D(e)}  >  1  -  e 
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(H  is  true) 

where  (H  is  true)  implies  one  of  the  p  hypotheses  given  by  (4.18)  are 
true. 

For  l  -  1,2, ...,k,  let 

n 

(4.50)  Z  =  —{  l  (X  .  -  e  (n))) 

/n  i  *  1 

Then,  an  asymptotically  optimal  decision  rule  is  given  by  the  following 
theorem. 

Theorem  4.2  An  asymptotically  optimal  decision  rule  for  the  problem  of 
selecting  the  best  population  (largest  scale  parameter)  is  given  by: 

For  i,j  *  1,2, ...,k,  select  Jl  as  the  best  population  if 
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(4.51) 


A 

z. 

1 


j  /  l 


Proof :  It  is  easy  to  see  that 


(4.52) 


z  -  z.  =  z  -  z. 

1  3  4  J 


.  n  n 

=  —  (  V  X  .  -  j  X..) 
/fT  i  =  l  U  1  =  1  ^ 


Thus  the  decision  variables  given  by  (4.51)  have  the  same  asymptotic 
distribution  as  the  minimax  (and  Bayes)  decision  rule  for  the  artificial 
problem.  Thus  the  decision  rule  given  by  (4.51)  is  an  asymptotically 
minimax  decision  rule  for  the  problem. 

Q.E.D. 

A 

It  may  be  noted  that  we  could  easily  have  defined  Z  by 

& 

1  n 

—  £  X  .  However  the  above  definition  (i.e.,  Z  given  by  (4.50)) 

/a  i=i  £1 

was  intentional  since  it  allows  us  to  observe  what  the  optimal  decision 
rule  would  look  like  when  the  populations  have  different  unknown  location 
parameters  as  the  nuisance  parameters  for  the  problem.  In  such  a  situation, 
if  we  let,  for  t  =  1,2, ...,k 


(4.54) 
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then  an  asymptotically  optimal  procedure  is  given  by  Theorem  4.2  with 
Zt  as  defined  by  (4.54)  (instead  of  (4.50)).  We  again  note  in  this 
case  that  by  using  a  simple  Bayes  rule  one  is  able  to  construct  an 
asymptotically  optimal  ranking  procedure. 

Optimal  Procedures  for  Other  Ranking  Goals 

In  Chapter  3,  we  considered  two  general  ranking  goals  which  have 
been  considered  in  the  literature,  and  developed  asymptotically  optimal 
procedures  for  situations  in  which  certain  regularity  conditions  hold. 

We  now  develop  an  optimal  procedure  for  the  two  general  ranking  goals 
considered  in  Chapter  3,  for  the  case  of  the  non-regular  exponential 
density  functions. 

Instead  of  repeating  the  detailed  analysis  of  Chapter  3,  we  will 
state  the  optimal  ranking  procedures  for  each  goal  in  terms  of  the 
following  theorems  (using  the  notation  defined  earlier  in  the  chapter). 

Case  (i)  is  the  ranking  parameter;  St  is  the  common  unknown  nuisance 

para  : ter 

Asymptotically  optimal  procedures  for  Goal  I  and  Goal  II  are  given 
by  the  following  theorem: 


Theorem  4.3  (i)  An  asymptotically  optimal  procedure  for  Goal  I,  (to 

select  s  populations  to  contain  at  least  d  of  the  t  best  populations) 
is  to  select  the  s  populations  associated  with  i]’‘**'^[k  s+1]* 
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(ii)  An  asymptotically  optimal  procedure  for  Goal  II  (selecting 
t  best  ordered  populations,  1  <  t  <  k)  is  to  select  the  t  populations 
associated  with  l]****,y[k  t*l]  aS  the  best»  tbe  seconti  best> 

etc.,  the  ttb  best  population  respectively. 

Case  (ii)  1/S^  is  the  ranking  parameter;  is  the  common  unknown 

nuisance  parameter 

Asymptotically  optimal  procedures  for  Goal  I  and  Goal  II  are  given 
by  the  following  theorem: 


Theorem  4.4  (i)  An  asymptotically  optimal  procedure  for  Goal  I  is 

to  select  populations  associated  with  ^[k]’^[k  s  +  1]  aS  tbe 

best  set  of  populations. 

(ii)  An  asymptotically  optimal  procedure  for  Goal  II  is  to  select 

a  A  A 

populations  associated  with  Z^j,Z^  ^,...,2^  t*l]  aS  the  best»  the 

th 

second  best,  etc.,  the  t  best  populations,  respectively. 

A 

In  this  case  Zf  is  as  defined  by  (4.50)  and  the  ordered  values 
are  denoted  by 


(4.55) 


Z[l]  ‘  Z[2]  ‘  ‘  Z[kl 


One  may  note  here  that  Theorem  4.4  could  be  generalized  to  the  case 

A 

of  different  (unknown)  location  parameters  L  (by  replacing  Z  as 

l  « 

A 

defined  by  (4.50),  by  Z^  as  defined  by  (4.54)). 


i 
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4.4  Asymptotically  Optimal  Procedure  for  Ranking  Non-Regular  Uniform 
Distributions 


We  now  develop  asymptotically  optimal  procedure  for  a  non-regular 
uniform  distribution. 

Let  X^,  i  =  l,2,...,n  denote  n  independent  observations  from 
nt,  t  =  l,2,,..,k,  each  with  pdf  f  (•(•)  given  as 


r 


(4.56)  ft(x|0)  =  f(xl0it.32t)  S\ 


02t‘eit 


if  elt  —  x  —  02t 


otherwise 


V. 


where  ®it»02t  denote  the  two  unknown  parameters  characterizing  population 

V 

Two  cases  of  interest  arise  here: 

(i)  02t  is  the  parameter  being  ranked;  6^  is  the  nuisance 
parameter. 

(ii)  0j  is  the  parameter  being  ranked;  is  the  nuisance 

parameter. 

The  two  cases  are  in  a  sense  very  similar  and  a  solution  to  one 
would  suggest  a  solution  to  the  other.  It  may  also  be  noted  that  the 
analysis  for  each  of  the  two  cases  of  interest  is  very  similar  to  the 
analysis  in  Section  4.3  for  ranking  location  parameters  (L^)  of 
non- regular  exponential  populations,  when  S  is  the  nuisance  parameter. 

To  avoid  repetition,  we  omit  the  detailed  analysis  and  the  asymptotically 
optimal  procedures  are  given  in  terms  of  the  following  theorems,  which 
we  state  without  proof. 


i 
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We  first  define  notation  to  be  used  in  the  statement  of  the  theorems. 
For  t  =  1,2, ...,k,  let 


(4.57) 


U  =  -in  (Xti) 

l<i<n 


(4.58) 


V  =  «ax  (X  ) 
l<i<n 


and  let  the  ranked  values  of  and  V  be  denoted  by 


(4.59) 


U[l]  *  U[2]  <  <  U(k] 


(4.60) 


v[i]  <  v[2]  <  ....  <  v[k] 


An  asymptotically  optimax  procedure  for  selecting  the  best  popula¬ 
tion  is  given  by  the  following  theorem. 


Theorem  4.5  (i)  An  asymptotically  optimal  procedure  for  selecting  the 

population  associated  with  the  largest  0^  is  to  select  the  population 
associated  with  V^. 

(ii)  An  asymptotically  optimal  procedure  to  select  the  population 
associated  with  the  largest  0^  is  to  select  the  population  associated 
with  U[k). 


i 


% 

4 
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For  the  two  general  goals  considered  in  Chapter  3  (Goal  I  and  Goal 
II),  asymptotically  optimal  procedures  for  the  problem  at  hand  are  as 
given  by  the  following  theorems: 


Case  (i)  02t  is  the  ranking  parameter 


Theorem  4.6  (i)  An  asymptotically  optimal  procedure  for  Goal  I  (to 

select  s  populations  to  contain  d  of  the  t  best  populations)  is 
to  select  the  s  populations  associated  with  V^_s+ij  »V[k-s]  *  *  *  *  *V[k] 

(i i )  An  asymptotically  .•'ptimal  procedure  for  Goal  II  (selecting 
t  best  ordered  populations,  1  <  t  <  k)  is  to  select  the  populations 


associated  with  V[)c]»V[k  i]*****V[k-t+l]  as  the  best*  second  besti 
tth  best  population  respectively. 


Case  (ii)  eit  is  the  ranking  parameter 


Theorem  4,7  (i)  An  asymptotically  optimal  procedure  for  Goal  I  is  to 

select  s  populations  associated  with  as  the 

best  set  of  populations. 

(ii)  An  asymptotically  optimal  procedure  for  Goal  II  is  to  select 
the  populations  associated  with  *^[k-l] ' ’  *  * ,lJ[k-t^l)  as  tbe  best» 

second  best,...,^  best  population  respectively. 

Since  the  purpose  of  this  chapter  was  only  to  indicate  how  the  method 


used  for  developing  asymptotically  optimal  procedures  for  regular  cases 
could  be  used  for  the  non-regular  cases  as  well,  we  do  not  indicate  the 
optimal  procesures  for  the  Laplace  and  Weibull  distributions  (the  two 
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remaining  non-regular  density  functions  listed  in  Section  4.1).  The 
method  used  in  Sections  4.3  and  4,4  Mould  be  used  for  these  cases  too 
and  the  analysis  is  very  similar  to  the  one  outlined  in  Section  4.3. 
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